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Transduction of recombinases for inducible gene targeting 

The present invention provides the use of a fusion protein comprising a 
site-specific DNA recombinase domain and a protein transduction domain 
for preparing an agent for inducing target gene alteration in a living 
organism or in cultured cells, suitable fusion proteins and a method for the 
production of said fusion proteins. 

Background 

For some years targeted mutagenesis In totipotent mouse embryonic stem 
(ES) cells has been used to inactivate genes, for which cloned sequences 
were available (Capecchi, Trends in Genetics 5, 70 - 76 (1989)). Since ES 
cells can pass mutations Induced in vitro to transgenic offspring in vivo, it 
Is possible to analyze the consequences of gene disruption in the context 
of the entire organism. Thus, numerous mouse strains with functionally 
inactivated genes ("knock out mice") have been created by this 
technology and utilized to study the biological function of a variety of 
genes. 

A refined method of targeted mutagenesis, referred to as conditional 
mutagenesis, employs a site-specific recombination system (e.g. Cre/loxP 
or Flp/frt - Sauer and Henderson, N. Proc. Nati. Acad. Scl. USA 85, 5166- 
5170 (1988); Senecoff et al., J. Mol. Biol., 201, 405 - 421 (1988)) which 
enables a temporally and/or spatially restricted alteration of target genes 
(Rajewsky et al., J. Clin. Invest., 98, 600 - 603 (1996)). The creation of 
conditional mouse mutants requires the generation of two mouse strains. 
I.e. the recombinase recognition strain and the recombinase expressing 
strain. The recombinase recognition strain Is generated by homologous 
recombination in ES cells as described above except that the targeted 
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exon(s) is (are) flanked by two recombinase recognition sequences ' 
(hereinafter "RRS"; e.g. loxP or frt). The type of recombination event 
mediated by the recombinase depends on the disposition of the RRS, with 
deletions, Inversions, translocations and integrations being possible 
(Torres and Kuhn, Oxford University Press, Oxford, IMew York (1997)). By 
placing the RRS into introns, an interference with gene expression before 
recombination can be avoided. The recombinase expressing strain 
contains a recombinase transgene (e.g. Cre, Flp) whose expression is 
either restricted to certain cells and tissues or is Inducible by external 
agents. Crossing of the recombinase recognition strain with the 
recombinase expressing strain recombines the RRS-flanked exons from 
the doubly transgenic offspring In a prespecified temporally and/or 
spatially restricted manner. Thus, the method allows the temporal analysis 
of gene function In particular cells and tissues of otherwise widely 
expressed genes. Moreover, It enables the analysis of gene function in the 
adult organism by circumventing embryonic lethality which Is frequently 
the consequence of gene mutation. For pharmaceutical research, aiming 
to validate the utility of genes and their products as targets for drug 
development, Inducible mutations provide an excellent genetic tool. 
However, the current systems for inducible recombinase expression in 
transgenic animals suffer from a certain degree of leakiness In the 
absence of the inducer (Kuhn et al.. Science 269(5229): 1427-9 (1995); 
Schwenk et al.. Nucleic Acids Res.; 26(6):1427-32 (1998)). Furthermore, 
the generation of conditional mutants is a time consuming and labor 
Intensive procedure, since the recombinase recognition strain and the 
recombinase expressing strain have to be breed at least over two 
generations In order to obtain animals carrying both, the recombinase 
transgene and two copies of the RRS-flanked target gene sequence. 

Protein tranduction domains (hereinafter shortily referred to as "PTD") that 
have the ability to cross cell membranes were identified, e.g. in the 
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Antennapedia protein from Drosophila (VIves et al., J. Biol. Chem/ ' 
272(25): 16010-7 (1997)), Kaposi fibroblast growth factor (Kaposi FGF; 
Lin et al., 3. Biol. Chem. 270: 14255-58 (1995)), VP22 from HSV (Elliott 
and O'Hare, Cell, 88(2):223-33 (1997)) and TAT from HIV (Green and 
Loewenstein, Cell, 55(6): 1179-88 (1988); Frankel and Pabo, Cell, 
55(6): 1189-93 (1988)). WO 99/29721 moreover mentions TAT mutants 
having an enhanced activity as compared to the wild-type peptide. 
Fusion of PTDs to heterologuous proteins conferred the ability to transduce 
into cultured cells (Fawell et al., Proc. Natl. Acad. Sci. USA, 91(2): 664-8 
(1994); Elliott and O'Hare (1997), Phelan et al.. Nature Biotech. 16; 440- 
443 (1998) and Dilber et al.. Gene Then, 6(1):12-21 (1999)). Dalby and 
Bennett showed that a fusion protein consisting of VP22 and functional Flp 
recombinase translocated between cells In culture (from COS-1 cells 
transfected with VP22-Flp to CHO cells carrying Flp recognition sites (FRT 
sites); see Dalby and Bennett, Invitrogen, Expressions 6.2, page 13 
(1999)). Further WO 99/11809 mentions a fusion protein Antp-Cre and 
emphasizes that it may be used to deliver the Cre into the cell which 
recombines Inside the cell nucleus. It Is mentioned that the fusion protein 
is suitable for manipulating genomic DNA at precise locations In a 
temporal regulated manner. 

Furthermore, a recent report demonstrated that the B-galactosidase 
protein fused to the 11 amino acids PTD from the HIV TAT protein can 
infiltrate all tissues of living mice reaching every single cell (Schwarze et 
al.. Science, 285(5433): 1569-72 (1999)). Finally, WO 99/60142 discloses 
vector constructs for gene therapy carrying a tumor cell sensitizing gene, 
a sensitizing gene expression regulatory system, a control gene and a 
control gene expression regulatory system, wherein the control gene can 
be a fusion gene consisting of a recombinase (viz. Cre or Flp) and a 
trafficking protein (viz. VP22). 



wo 01/49832 PCT/EPOl/00060 

4 

With regard to the fusion protein Antp-Cre of WO 99/11809, it is however, - 
general knowledge in the art that the Antennapedia PTD is not a generally 
applicable transducing protein, namely it has only a limited activity with 
proteins having more than 100 amino acid residues (Derossi et al.. Trends 
Cell Biol. 8: 84-87, 1998). In view of the limited transducing activity of the 
Antp PTD and the size of the generally known recombinases (ranging from 
about 200 to about 600 amino acid residues), it was desirable to provide a 
more potent system for the transduction of recombinases. It was, 
however, not clear for a person skilled in the art whether PTDs would be 
effective at all with recombinases for the following reasons: 

(i) only a single example of PTD-mediated delivery of proteins (above 100 
amino acid residues) in vivo has been reported so far (Schwarze et al., 
Science, 285(5433): 1569-72 (1999); Fawell et al., PNAS, 91: 664-68 
(1994); both references describing the TAT-mediated transduction of B- 
galactosidase in mice); 

(ii) it is known that - due to defolding and refolding processes - the 
transduction of native proteins Into cells may result in a significant loss of 
protein activity (e.g., as described for TAT-GFP; Schwarze et al. Trends 
Cell Biol. 10: 290-95 (2000)); 

(ill) neither the number of protein molecules that can be transferred into a 
cell by a given translocation domain has been systematically determined, 
nor the number of Cre molecules In the cell nucleus that is required for 
efficient recombination; 

(iv) the delivery of active proteins requires unfolding- and proper refolding 
which is unpredictable for a given protein (Bonifaci et al., AIDS 9: 
995-1000 1995); and 

(v) the mechanism by which protein transduction domains facilitate 
protein transduction in unknown and several findings have been published 
that rule out classical receptor-, transporter-, endosonie- or endocytosis- 
mediated processes in the transduction of Ant, TAT and VP22 (G. Eliott, P. 
O'Hare, Cell 88, 223-233 (1997); D.A. Mann, A.D. Frankel, EMBO. J. 10, 
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1733-1739 (1991); D. Derossi et al., J. Biol. Chem. 269, 10444-10450 
(1994); D. Deross! et al., J. Biol. Chem. 271, 18188-18193 (1996); E. 
Vives et al., J. Biol. Chem. 272, 16010-16017 (1997)). 

Moreover, there was still the need for a generally applicable method where 
the genetic manipulation can be performed in both, endogenous genes 
and transgenes. 

Summary of the Invention 

It was found that site-specific DNA recombinase proteins can. be 
translocated into cells of a living organism when fused to specific protein 
transduction domains, namely transduction domains being derived from 
the VP22 protein of HSV or from the TAT protein of HIV. Thus, whenever a 
gene mutation is desired, recombination is induced upon the injection of 
the appropriate site-speclfic recombinase fused to a transduction domain 
Into such a living organism (provided, however, that said organism carries 
at least one appropriate RRS integrated in the genome). 

The present Invention thus provides 

(1) the use of a fusion protein comprising 

(a) a site-specific DNA recombinase domain and 

(b) a protein transduction domain (PTD) 

for preparing an agent for inducing target gene alterations In a living 
organism or cell culture, wherein said living organism carries at least one 
or more recognition sites for said site-specific DNA recombinase integrated 
in Its genome; 

(2) a method for inducing gene alterations in a living organism which 
comprises administering to said living organism a fusion protein 
comprising a site-specific DMA recombinase domain and a PTD as defined 
in (1) above, wherein said living organism carries at least one or more 
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recognition sites for said site-specific DNA recombinase Integrated In Its- -i 
genome; 

(3) a fusion protein comprising 

(a) a site-specific DNA recombinase domain and 

(b) a PTD being derived from the VP22 protein of HSV or from the TAT 
protein of HIV 

provided that when the site-specific DNA recombinase domain Is wild-type 
Cre or Flp then the PTD is not the full length VP22 PTD of HSV (i.e., the 
fusion protein is not Identical to the fusion protein of Dalby and Bennett, 
Invitrogen, Expressions 6.2, page 13 (1999) and of WO 99/60142); 

(4) a DNA sequence coding for the fusion protein of (3) above; 

(5) a vector comprising the DNA sequence as defined In (4) above; 

(6) a host cell transformed with the vector of (5) above and/or comprising 
the DNA of (4) above; 

(7) a method for producing the fusion protein of (1) above which 
comprises eulturing the transformed host cell of (5) above and isolating 
the fusion protein; and - 

(8) an Injectable composition comprising the fusion protein as defined In 
(1) or (3) above. 

The invention Is further Illustrated by the appended Rgures and Is 
explained in detail below. 

Description of the Figures 

Flo. 1 : Generation of induced mouse mutants using purified fusion 
proteins. 

A: Expression of the fusion protein consisting of the site-specific DNA 
recombinase (e.g. Cre) and the protein transduction domain (e.g. the HIV 
derived TAT peptide) in prokaryotic or eukaryotic cells. 
B: Extraction and purification of the expressed fusion protein (e.g. as 
described in Nagahara et al., Nat. Med. 4 (12): 1449-52 (1998)). 
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C: Injection of the purified fusion protein into mice carrying tfie RRS- 
flanlced target sequence. 

D: Analysis of tine pattern of Induced target gene recombination and the 
resulting phenotype. 
Triangle: RRS. 

Fig. 2: Scheme of the bacterial expression vector pT7-TACS (SEQ ID 
NO: 16). The coding region of the 11 amino acid protein transduction 
domain of HIV TAT protein is fused to the N-terminus of the Cre 
recombinase protein sequence. The 10-amlno-acld strep tag and the 
protease factor Xa recognition sequence are fused to the C-terminus. The 
T7 pronK)ter permits expression of TAT-Cre protein In E. coli. 

FiQ. 3: Detection of purified TAT-Cre protein by Coomassle staining and 
Western blot analysis. 

A: Coomassie stained SDS-PA6E gel. Lane 1: 10 kDa ladder (Ufe 
Technologies, Cat. No.: 10064-012), 2: 1000 ng BSA, 3: 750 ng BSA, 4: 
500 ng BSA, 5: 100 ng BSA, 6: 50 ng BSA, 7: 5 \\\ TAT-Cre, 8: 1 |Jl TAT- 
Cre In Bicine buffer. 

B: Western blot analysis using an alkaline phosphatase-conjugated anti- 
strep tag antibody (IBA, Cat. No: 2-1503-001). Lane 1: MultlMark 
(Invltrogen, Cat. No.: LC5725), 2: 7 pi TAT-Cre, 3: 5 pi TAT-Cre, 4: 2,5 \x\ 
TAT-Cre, 5: 1,25 pi TAT-Cre In Bicine buffer. 

Fig. 4: X-Gal staining of M5Pax8 cells treated with TAT-Cre protein. 
MSPaxB fibroblasts where treated for 18 h with 3,5 (A), 6,9 (B) and 13,8 
pg/ml TAT-Cre protein (C) in serum-free medium. Four days after 
treatment, cells were fixed and stained with X-Gal. 

Fig. 5: Measurement of B-galactosidase activity in cell lysates. M5Pax8 
fibroblasts where treated for 18 h with increasing concentrations of TAT- 
Cre, as indicated, or transiently transfected with either expression vectors 
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for Cre (pCMV-I-Cre-pA, see SEQ ID NO:29) or 6-galactosidase (pCMV-I- ' 
6-pA, see SEQ ID NO: 30). Four days after treatment, cells were lysed and 
the 6-galactosidase activities were determined. 

Fig. 6: PGR detection of TAT-Cre mediated recombination in mice. . 
A: PCR-analysis of genomic DNA from duodenum (lane 2), liver (3), 
kidney (4), spleen (5), muscle (6), lung (7), tail (8) and brain (9) of a 
plnl3 mouse treated three times with Intraperitoneal injections of 75 pg 
TAT Cre protein at two-day-intervals. Deletion of the loxP-flanked DNA 
segment is indicated by the presence of the about 400 bp fragment. Lane 
1: 1-kb-ladder (Life Technologies). 

B: PGR strategy to detect Gre-mediated deletion of the loxP-flanked DNA 
segment. Arrows Indicate the positions of the primers. 
C: PCR-analysis of genomic DNA from spleen of a plnl3 mouse treated 
three times with Intraperitoneal injections of 75 ng TAT Cre protein at two- 
day-lntervals (lane 4). To confirm the presence of the BamH I restriction 
site, the PCR product was digested with BamH I which produces two 
diagnostic fragments of about 190 and about 210 bp (5). As a control, tail 
DNA from untreated mice carrying the loxP-flanked (lane 2) and the 
detected plnl3 allele (3) was subjected to PCR amplification. Lane 1: 100 
bp ladder (Life Technologies), lane 6: 1 kb ladder (Life Technologies). 

Rg. 7: Scheme of the bacterial expression vectors pT7-VPCS (SEQ ID 
N0;17) and pCRT7-AVPCS (SEQ ID N0:15). The coding region of the 301 
amino add protein transduction domain of HSV VP22 protein (A) or the 
truncated 143 amino acid A\/P22 domain (B) Is fused to the N-termlnus of 
the Cre recombinase protein sequence. The 10-amino-acid strep tag and 
the protease factor Xa recognition sequence are fused to the C-terminus. 
The T7 promoter allows the expression of VP22-Cre and AVP22-Cre fusion 
proteins in E coli. The sequence in pCRT7-AVPCS encoding the 15 amino 
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acid N-terminal leader sequence is used for enhanced protein stability 
(Invitrogen). 

nq, 8: Detection, of the purified VP22-Cre and AVP22-Cre fusion proteins 
by Coomassie staining and Western blot analysis. 
A: Detection of VP22-Cre protein in a Coomassie-stalned SDS-PAGE gel. 
Lane 1: 10 kDa ladder, 2: 1000 ng BSA, 3: 500 ng BSA, 4: 100 ng BSA, 
5: inclusion body protein extract before chromatography, 6: unbound 
protein, 7: fraction 17, 8: fraction 18, 9: fraction 19, 10: fraction 20. The 
position of the 75 kDa VP22-Cre protein is indicated by the arrow head. 
B: Detection of VP22-Cre protein by Western blot analysis using an 
alkaline phosphatase-conjugated anti-strep tag antibody (IBA, Cat. No.: 2- 
1503-001). Lane 1: MultiMark (Invitrogen), 2: inclusion body protein 
extract before chromatography, 3: unbound protein, 4: fraction 10, 5: 
fraction 11, 5: fraction 16, 6: fraction 17, 7: fraction 18, 8: fraction 19, 9: 
fraction 19, 10; fraction 20. 

C: Detection of AVP22-Cre protein in a Coomassie-stained SDS-PAGE gel. 
Lane 1: 10 kDa ladder, 2: inclusion body protein extract before 
chromatography, 3: unbound protein, 4: fraction 1, 5: fracticTn 8, 5: 
fraction 9, 7: fraction 15, 8: 100 ng BSA, 9: 500 ng BSA, 10: 1000 ng 
BSA. The position of the 60 kDa AVP22-Cre protein is indicated by the 
arrow head. 

D: Detection of AVP22-Cre protein by Western blot analysis using a 
alkaline phosphatase-conjugated anti-strep tag antibody (IBA, Cat. No.: 2- 
1503-001). Lane 1: MultiMark (Invitrogen), 2: Inclusion body protein 
extract before chromatography, 3: unbound protein, 4: fraction 4, 5: 
fraction 8, 6: fraction 10, 7: fraction 12, 8: soluble protein extract before 
chromatography, 9: unbound protein, 10: fraction 7. 



Rq. 9: X-Gal staining of M5Pax8 cells treated with VP22-Cre and AVP22- 
Cre fusion proteins. M5Pax8 fibroblasts where treated for 18 h with either 
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Blcine buffer (A), 0.5 pg/ml VP22-Cre (B) or 3.75 g/ml AVP22-Cre (C) in 
serum-free medium. Four days after treatment, cells were fixed and . 
stained with X-Gal. 

Rg. 10: Measurement of 6-galactosidase activity in cell lysates. M5Pax8 
fibroblasts wliere treated for 18 h with VP22-Cre, AVP22-Cre or Bicine 
buffer alone, as indicated or transiently transfected with expression 
vectors for Cre (pCMV-I-Cre-pA, see SEQ ID NO: 29) or B-galactosidase 
(pCI^V-I-B-pA, see SEQ ID NO:30). Four days after treatment, cells were 
lysed and the B-galactosidase activities were determined. 

Rg. 11: PGR detection of Cre mediated recombination in cells treated with 
VP22-Cre and AVP22-Cre fusion proteins shown In SEQ ID NOs: 21 and 
14, respectively). 

A: PCR-analysIs of genomic DNA isolated from M5Pax8 fibroblasts. Cells 
were transiently transfected with a Cre expression vector (lane 2) or 
treated for 18 h with either buffer alone (lane 3), 7.5 pg/ml VP22-Cre (4, 
5) or 15 |jg/ml AVP22-Cre (6, 7) In serum-free medium. Four days after 
treatment, genomic DNA was extracted and subjected to PCR 
amplification. Deletion of the loxP-flanked DNA segment Is indicated by the 
presence of the 226 bp DNA fragment. To confirm the presence of the Nco 
I restriction site in the recombined allele, the PCR products were digested 
with Nco I which produces two diagnostic fragments of 85bp and 141bp 
(lanes 5 and 7). Lane 1: 100 bp ladder (Life Technologies), lane 8: 1 kb 
ladder (Life Technologies). 

B: PCR strategy to detect Cre-medlated deletipn of the loxP-flanked DNA 
segment. Arrows Indicate the positions of the primers. 

Detailed Description of tlie Invent! n 

The expression "target sequences" according to the present invention 
means all kind of sequences which may be mutated (viz. deleted. 
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translocated; integrated and/or Inverted) by the action of the 
recomblnase. The number of RRS in the target sequence depends on the 
kind of mutation to be performed by the recombinase. For most of the 
mutations (especially for deletions and Invertlons) two RRS are required 
which are flanking the sequence to be mutated (deleted or inverted). For 
some kinds of integrations only one RRS may be necessary within the 
target sequence. 

The "living organisms" according to the present invention are multi-cell 
organisms and can be vertebrates such as mammals (e.g., rodents such 
as mice or rats) or non-mammals (e.g., fish) or can be invertebrates such 
as insects or worms, or can be plants (higher plants, algi or fungi). Most 
preferred living organisms are mice and fish. 

"Cell culture" according to the present invention include cells Isolated from 
the above defined living organism and cultured in vitro. These cells can be 
transformed (immortalized) or untransformed (directly derived from the 
living organism; primary cell culture). 

The site-specific DNA recombinase domain within the fusion protein of the 
Invention of the present application is preferably selected from a 
recombinase protein derived from Cre, Flp, (t>C31 recombinase (Thorpe and 
Smith, Proc Natl. Acad. Sci, USA, vol. 95, 5505-5510 (1998)), 76 
resolvase (Schwickardi and Droge, FEBS letters 471:147-150 (2000) and 
R recombinase (Araki et a!., J. Mol. Biol., 182, 191-203 (1985)). The 
preferred recomblnases are Cre and mutants thereof (preferably the Cre 
variant of aa 15 to 357 of SEQ ID NO: 2 or aa 325-567 of SEQ ID NO: 6) 
and Flp and variants thereof Including FIpe (preferably the Flp variant of 
aa 15 to 437 of SEQ ID NO: 4 or aa 325 to 747 of SEQ ID NO: 8). 
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The protein transduction donnain according to the present invention ' 

Includes, but is not limited to, the PTDs mentioned in Bacl<ground of the 

Invention- The PTD preferably Is derived from the VP22 protein of HSV or 

from the TAT protein of HIV. Suitable TAT proteins include, but are not 

limited to, proteins comprising (i) the amino acid sequence shown In SEQ 

ID NO: 10 and mutant thereof such as 

(ii) proteins comprising the amino acid 

AGRKKRRQRRR (SEQ ID NO: 22) 

YARKARRQARR (SEQ ID NO:23) 

YARAAARQARA (SEQ ID NO: 24) 

YARAARRAARR (SEQ ID N0:25) 

YARAARRAARA (SEQ ID NO: 25) 

YARRRRRRRRR (SEQ ID NO:27) 

YAAARRRRRRR (SEQ ID NO:28) 

as known from WO 99/29721. Preferred are transduction domains 
consisting of the TAT proteins (1) and (II) above. 

Suitable \/P22 proteins Include, but are not limited to, the wild-type VP22 
protein, i.e., a protein comjDrising amino acids 1 to 302 of SEQ ID No:21, 
and truncated forms thereof. Truncated VP22 proteins in accordance with 
the present invention can be those lacking 1 to 158 amino acid residues at 
their N-termlnal end. The nriost preferred VP22 protein Is the truncated 
VP22 PTD comprising amino acid residues 15 to 157 of SEQ ID NO: 14. 

The fusion of the two domains of the fusion protein can occur at any 
possible position, i.e., the protein transduction domain can be fused to the 
N- or C-terminal of the site-specific DNA recombinase or can be fused to 
active sites within the site-specific DNA recombinase. Preferably the 
protein transfusion domain fs fused to the N-termlnal of the site-specific 
DNA recombinase domain. 
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The protein transduction domain can be fused to the site-specific DNA 
recombinase either through a direct chemical bond or through a linker 
molecule. Such linker molecule can be any bivalent chemical structure 
capable of linking the two domains. The preferred linker molecule 
according to the present invention is a short peptide, e.g., having 1 to 20, 
preferably 1 to 10, amino acid residues. Specifically preferred short 
peptides are essentially consisting of Gly, Ala and/or Leu. 

The fusion protein of the Invention of the present application may further 
comprise other functional sequences such as secretion conferring signals, 
nuclear localisation signals and/or signals conferring protein stabilisation. 

In case the fusion protein comprises a protein transduction domain 
derived from the TAT protein of HIV, the DNA sequence coding for said 
fusion protein preferably comprises the sequence 
5' TAC GGC CGC AAG AAG CGC CGC CAA CGC CGC CGC 3'. 

Such a preferred DNA sequence is for instance shown in SEQ ID NO: 11. 
In said sequence the 3' terminal codon ggc codes for the linker Gly. The 
DNA sequence of a suitable recombinase may be directly attached to said 
codon ggc. 

The fusion protein can be obtained by the following steps: 

1. Fusion of the recombinase coding region (e.g. encoding Cre: see amino 
acids 15 to 357 of SEQ ID NO: 2) with the sequence conferring protein 
translocation (e.g. the sequence encoding the TAT peptide 
YGRKKRRQRRR, SEQ ID NO: 10) using standard cloning protocols 
(l^anlatis et al.. Cold Spring Harbor Laboratory, New York (1989)) or 
chemical synthesis. 
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2. Generation of a construct for the expression of the fusion protein in ' 
proi<aryotic or eukaryotic cells, e.g. in E. coli DH5a (Hanahan, J. Mol. 
Biol.; 166(4) -.557-80 (1983)) using the QIAexpress pQE vector (Qiagen, 
Hilden). 

3. Expression of the above mentioned fusion protein In prokaryotic or 
eukaryotic cells, e.g. in E. coli DH5a (Hanahan, 1983) 

4. Extraction and purification of the above mentioned fusion protein e.g. 
as described in Nagahara et al., Nat. I^ed., 4(12): 1449-52 (1998). 

In an experiment it was shown that TAT-mediated delivery of active Cre 
protein works with sufficient efficacy to facilitate inducible gene targeting ■ 
both in cell lines and living organisms. In this experiment a vector for the 
expression of a TAT-Cre fusion protein in E. coli was constructed, TAT-Cre 
protein was expressed in E. coli and purified from bacterial lysates. To test 
the activity of the TAT-Cre protein in vitro, a reporter cell line that 
contains a ioxP-containing reporter construct was used. This reporter, 
when recombined by Cre recombinase, allows the expression of a B- 
galacosldase gene. Further, a transgenic mouse strain carrying a loxP- 
flanked target was used to invest the activity of the TAT-Cre protein in 
vivo. 

In a second experiment it was shown that \/P22-mediated delivery of 
active Cre protein works with sufficient efficacy to facilitate Inducible gene 
targeting. In this experiment Bacterial expression vectors were 
constructed for the production of VP22-Cre fusion proteins in E. coll. The 
activity of purified VP22-Cre proteins were tested using a reporter 
fibroblast cell line containing a loxP-flanked reporter construct. 

Thus, the injection of the purified fusion protein of the present invention 
into a living organism (e.g., a mouse) carrying a gene comprising the 
RRS-fianked target sequence (e.g., in an amount of 1 to 200, preferably 5 
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to 50 pg per g body weight). To demonstrate- the feasibility of the ' 
invention, a reporter mouse strain carrying an RRS-flanked cassette was 
used (Thorey et a!., MoL Cell Biol., 18(10):6164 (1998)). 

Analysis is achieved by determining the pattern of induced target gene 
recombination (e.g. through PCR analysis, Southern blot analysis or X-Gal 
staining on tissue sections; Maniatis et a!., 1989; Gossler and Zachgo, 
Joyner AL (Ed.), Oxford University Press, Oxford, New York (1993)). 

The procedure's advantages over current technology are as follows: 

/' ■] 

(i) The absence of background recombination before administration of 
the fusion protein. 

(ii) The reduction of time and resources which are necessary to combine 
the recombinase transgene and two copies of the RRS-flanked target 
gene by conventional breeding. 

In experiments it was shown the following: (a) With a suitable vector for 
the expression of a TAT-Cre fusion protein, a TAT-Cre fusion protein was 
expressed in E co// and purified from bacterial lysates. 

(b) A reporter cell line containing a loxP-contalning reporter construct was 
used to test the activity of the TAT-Cre protein in vitro. This reporter, 
when recombined by Cre recombinase, allows the expression of a B- 
galacosidase gene. 

(c) A transgenic mouse strain carrying a loxP-flanked target was used to 
invest the activity of the TAT-Cre protein in vivo. 

These experiments demonstrate that TAT-medlated delivery of active Cre 
protein works with sufficient efficacy to facilitate inducible gene targeting 
both in cell lines and living organisms. 
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Furthermore, bacterial expression vectors were constructed for the 
production of VP22-Cre fusion proteins in E. coli. The activity of purified 
VP22-Cre proteins were tested using a reporter fibroblast cell line 
containing a loxPTflanked reporter construct. These experiments 
demonstrate that VP22-mediated delivery of active Cre protein works with 
sufficient efficacy to facilitate Inducible gene targeting. 

The Invention is further illustrated by the following, non-limitative 
examples. 

Examples 

Materials and Methods 

Construction of DT7-TACS: The TAT-Cre coding region was generated by 
PGR using Advantage-HF PGR Kit (Glontech), 20 pmol of the primers 
TATcre sense (5'-atg cca tgg get acg gcc gca aga age gcc gcc aac gcc gcc 
i i gcg gca tgt cca att tac tga ccg tac acc-3'; SEQ ID N0:31) and TATcre 
antisense (5'-ttt egg ate egc cgc ata .acc agt g-3'; SEQ ID NO:32) and 10 
ng pGMV-I-Cre-pA (see SEQ ID NO: 29) as template. The PGR reaction was 
performed using the following cycle profile: 2' 94 '^G, 4 x (30" 94 min, 
30" 50 ^G, 1' 72 "G), 12 x (30" 94 "G min, 30" 55 °C, 1' 72 °C) and 10' 
72 ®G. The resulting PGR fragment was digested with Nco I and BamH I, 
treated with Klenow enzyme and ligated into the plasmid pBSII KS+ which 
had been opened with restriction enzyme BamH I, treated with Klenow 
and dephosphorylated with calf intestinal phosphatase. The resulting 
plasmid pBS TAT-5'cre was verified by DNA sequencing. The Plasmid 
pG|viV-I-Cre-pA (SEQ ID NO:29) was digested with Age I and Sal I which 
released a 1,036 kb fragment containing the 3' part of the Gre coding 
region. This fragment was ligated into the plasmid pBS TAT-5'cre which 
had been opened with Age I and Sal I. 
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10 ng pBS-TATCre was subjected to PCR amplification using 20 pmol of 
primers FPAOOl (5'-tat ate tag acc atg ggc tac ggc cgc aag aag c-3'; SEQ 
ID NO: 33) and FPA002 (5'-gct acc acg acc ttc gat acc ate gcc ate ttc cag 
cag gcg e-3'; SEQ ID N0:34). PCR was performed using 2,5 U Platinum 
Pfx DNA polymerase (Gibco BRL) and 2 x Enhancer Solution (Gibco BRL) 
according to tine manufacturers protocol. Tlie following cycle profile was 
used: 2' 94 °C, 25 x (30" 94 °C min, 15" 54,6 "C, 2'30" 68 "C). The 
amplified PCR fragment was purified using GFX columns (Amersham ■ 
Pharmacia), digested with Xba I and ligated Into the plasmid pASK57 
(Skerra and Arne, Gene 151: 131-135 (1994)) which had been opened 
with restriction enzymes Xba I and Eco 47 III and dephosphorylated with 
calf intestinal phosphatase. The resulting plasmid pASK75-TACS was 
digested with restriction enzymes Nco I and Hind III which released a 1,1 
kb fragment. The fragment was subsequently ligated into the plasmid pT7- 
7 (Studler and Moffatt, J. Mol. Biol. 189: 113-130 (1986)) which had been 
opened with restriction enzymes Nco I and Hind III and dephosphorylated 
with calf Intestinal phosphatase resulting In the plasmid pT7-TACS (SEQ ID 
NO: 16). 

Construction of DT7-VPCS: The Cre coding region was generated by PCR 
using Advantage-HF PCR Kit (Clontech), 20 pmol of the primers VP22cre 
sense (5'-taa eta gcg gcc gca tgt eca att tac tga ccg tac ac-3'; SEQ ID 
NO:35) and VP22cre antisense (5'-teg age ggc ege eat cgc cat ett cea gea 
ggc g-3'; SEQ ID NO:36) and 10 ng pgkere-pA (SEQ ID NO:40) as 
template. The PCR reaction was performed using the following cycle 
profile: 2' 94 °C, 5 x (30" 94 •'C, 30" 50 «C, 2' 72 «C), 15 x (30" 94 «>C, 
30" 55 ''C, 2' 72 ''C) and 10' 72 °C. The resulting PCR fragment was 
digested with Not I and ligated into the plasmid pVP22/Myc-His 
(Invitrogen), which had been opened with restriction enzyme Not!, 
dephosphorylated with calf Intestinal phosphatase. The resulting plasmid 
pVP22-ere myc/His was verified by DNA sequencing. 
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10 ng pVP22-cre myc/Hls was subjected to PGR amplification using 20 
pmol of primers FPA004 (5'-tat ate tag aca tat gac etc teg ccg etc cg-3'; 
SEQ ID N0:37) and FPA002 (SEQ ID NO:34). PGR was performed using 
2,5 U Platinum Pfx DNA polymerase (Gibco BRL) and 2 x Enhancer 
Solution (Gibco BRL) according to the manufacturers protocol. The 
following cycle profile was used: 2' 94 ^C, 25 x (30" 94 °G min, 15" 54,5 
^Cr 2'30" 68 °G). The amplified PGR fragment was purified using GFX 
columns (Amersham Pharmacia), digested with Xba I and ligated into the 
plasmid pASK57 (Skerra and Arne, Gene 151: 131-135 (1994)) which had 
been opened with restriction enzymes Xba I and Eco 47 III and 
dephosphorylated with calf Intestinal phosphatase. The resulting plasmid 
pASK75-VPGS was digested with restriction enzymes Nde I and Hind III 
which released a 2,0 kb fragment. The fragment was subsequently ligated 
Into the plasmid pT7-7 (Studier and Moffatt, J. Mol. Biol. 189: 113-130 
(1986)) which had been opened with restriction enzymes Nde I and Hind 
III and dephosphorylated with calf Intestinal phosphatase resulting In the 
plasmid pT7-VPCS (SEQ ID NO: 17). 



Gonstruction of DGRT7-AVPGS: The AVP22-Gre coding region was 
generated by PGR using Platinum Pfx DNA polymerase (Life Teehnologles)^ 
20 pmol of the primers FPA007 (5'-ttc cga aga cga cga aac ace-3'; SEQ ID 
NO:38) and FPA008 (5*-tat att cga age tta tta ace ace gaa ctg eg-3'; SEQ 
ID NO:39) and 30 ng pT7-VPCS (SEQ ID N0:17) as template. The PGR 
reaction was performed using the following cycle profile: 2' 94 ^C, 25 x 
(30" 94 ^C, 30" 61 °G, 2'30" 68 °G) and 7' 68 ^G. The resulting 1,8 kb 
PGR fragment was digested with Neo I and Sfu I and ligated into the 
plasmid pGRT7/VP22-l (Invitrogen), which had been opened with 
restriction enzymes Nco I and Sfu I, and dephosphorylated with calf 
intestinal phosphatase. The resulting plasmid pGRT7-AVPCS (SEQ ID 
NO: 15) was verified by DNA sequencing. 
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Expression of the fusion proteins in E. coli: E. coli BL21(DE3)-RIL cells 
(Stratagene) were transformed with pT7-TACS and grown on LB agar 
plates containing JOO pg/ml ampicillin. E. coli BL21(DE3)-RP cells 
(Stratagene) were transformed with pT7-VPCS and grown on LB agar 
plates containing 100 MQ/ml ampicillin. E. coli BL21(DE3)-pLysS 
(Invltrogen) were transformed with pCRT7-AVPCS and grown on LB agar 
plates containing 25 pg/ml kanamycine and 34 [^g/m\ chloramphenicol. 
Single colonies were isolated and used to prepare glycerol stocks. Eight 
5ml LB (Lura Bertani) aliquots containing antibiotics were inoculated with 
stabs from the glycerol stocks and grown overnight at 37°C with shaking. 
Two 5ml overnight cultures were each used to inoculate one of four IL LB 
aliquots containing antibiotics and grown at 37°C with shaking. Growth 
rate was monitored by spectrophotometry at 578nm. When the cultures 
had obtained an OD578 = 0,5 expression of the fusion proteins were 
Induced by the addition of 0,5 ml^ isopropyl-B-D-l-thiogalactopyranosid 
(IPTG). Two hours after Induction cells were harvested by centrlfugatlon at 
12000xg and the pellet rapidly frozen in liquid nitrogen and stored 
Immediately at -80°C. 

Purification of the fusion proteins from bacterial I vsates: Each lOg cell 
pellet was resuspended on Ice In 30nnl Bicine buffer (50mM BIcine, pH 8,5) 
including one protease inhibitor tablet (Complete, Roche). Cells were lysed 
through threefold treatment (1500psl, 5 minutes) with the cell disruption 
bomb (Parr Instrument). 30ml of Benzonase (lOOOOU, Merck) was added 
and cell extracts were Incubated for 30 minutes at 4°C. Cell extracts were 
then centrifuged at i2,000xg (4''C). The pellet was redissolved in 8M 
urea, 50mM Bicine, lOOmM DTT, pH 8,5 by incubation for 16 hours at 
4<'C. Protein extract was centrifuged at 31000xg and supernatant 
harvested. Protein extract was diluted in an equal volume of 
Chromatography buffer A (50mM Bicine, pH 8,5). PH was adjusted to pH 
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8,5 and the extract was filtered through a 0,45Mnn filter (Mllllpore). FPLC 
(Akta Explorer, Amersham Pharmacia), was performed using a cation 
exchange column (Sepharose SP, Column body HR_5/5 (0.5 x 5cm), 
column volume (CV) 1ml, linear flow 300cm/hour, Amersham Pharmacia). 
After addition of sample to FPLC column, buffer was exchanged with 
Chromatography buffer A at 10 CV. 

TAT-Cre and VP22-Cre fusion proteins were eluted from the column by 
gradient elution using chromatography buffer B (50mM Bicine, IM NaCI, 
pH 8,5) using the following profile: 0-50 % buffer B, 0 CV; 50 % buffer 
B, 10 CV; 50 - 100 % buffer B (linear gradient), 20 CV; 100 % buffer B, 
10 CV. AVP22-Cre protein was eluted from the column by gradient elution 
using the following profile: 0 - 10 % buffer B, 0 CV; 10 % buffer B, 10 CV; 
10 - 30 % buffer B, 0 CV; 30 % buffer B, 10 CV; 30 - 100 % buffer B, 0 
CV; 100 % buffer B, 10 CV. Three 1,5ml fractions each containing purified 
fusion proteins were collected. Purity and concentration of protein 
fractions were determined by Coomassie blue stained SDS-PAGE gels and 
Western blot analysis using dilutions of BSA standard solutions. In addition 
protein content was determined using a Bradford assay (Coomassie Plus 
protein assay. Pierce). 

SDS-PAGE and Western blot analysis: SDS-PAGE and Coomassie staining 
was performed according to standard protocols (Maniatis et al.. Cold 
Spring Harbor Laboratory, New Yorl< (1989)) using 4-12 % gradient 
SDS-polyacrylamide gels (NuPAGE, Invitrogen, cat. no.: NP0321). 
Western blot analysis was performed using a Semi-Try Blotting Chamber 
(Biorad) arid nitrocellulose membranes (0,2 pm; Schleicher & Schuell) 
according to the manufacturers protocols. The fusion proteins were 
detected by using an alkaline phosphatase-conjugated anti-strep tag 
antibody (IBA, Cat. No.: 2-1503-001) according to the manufacturers 
protocol. 
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Generation of the M5Pax8 Cre reporter cell line: The SV40-transformed 
murine embryonic fibroblast line I^EF5/5 (Schwenk et a!., Nucl Acids Res 
26(6), 1427-32 (1998)) was transfected with the vector pPGKpaXl 
(KelIendonl< et alv Nucl. Adds Res. 24, 1404-11 (1996)). 10^ IWEF5/5 cells 
were electroporated with 20 pg pPGKpaXl plasmid DNA linearised with 
Sea I and plated Into 48-well-plates. The cells were cultured in 
DMEM/Glutamax medium (Life Technologies) supplemented with 10 % 
fetal calf serum at 37°C, 10 % CO2 in humid atmosphere. Two days after 
transfection the medium was supplemented with 5 pg/ml puromyclne 
(Calbiochem) for the selection of stable integrants. 14 puromycine- 
resistant clones were expanded and tested by transien transfection with 
the Cre expression vector pPGK-Cre-pA (SEQ ID NO: 40). In two out of 
the 14 puromycine-resistant clones, the expression of B-gaiactosidase 
could be detected by staining with X-Gal. One of these clones, MSPaxS, 
was used as Cre reporter cell line. 

Transfection and measurement of B-aalactosidase activity: Rbroblasts 
(10^ cells per 24 well plate (Falcon)) were transfected with 25 ng pCMV-I- 
Cre-pA (see SEQ ID NO: 29) or pCMV-I-6-pA (see SEQ ID N0:30) plasmids 
using the FuGene transfection reagent (Roche Diagnostics). After 2 days 
the cells were lysed and the 6-galactosidase activities were determined 
with the B-galactosidase reporter gene assay (Roche. Diagnostics) 
according to the manufacturers guidelines using a Lumistar lumlnometer 
(MWG). 

HIstochemical detection of B-oalactosldase activitv: To quantltate B- 
galactosidase expression, fibroblast cells were washed once with 
phosphate buffered saline (PBS), and the ceils were fixed for 5 minutes at 
room temperature In a solution of 4% formaldehyde in PBS. Next, the cells 
were washed twice with PBS and finally Incubated in staining solution for 
24 hours at 37°C (staining solution: 5 mM K3(Fe(CN)6), 5mM 
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K4(Fe(CN)5), 2mM MgCI2, Img/ml X-Gal (BioMol) in PBS). Blue stained, 
B-galactosidase positive ceils were detected and distinguished from 
negative (transparent) cells fn a cell culture binocular microscope under 
200x magnification. For eacli determination a minimum of 200 cells was 
counted. 

PGR detection of Cre-mediated recombination: Genomic DNA extracted 
from tissue samples was subjected to PGR using Taq-polymerase (Gibco 
BRL Gat. No. 10342-020) using 20 pmol of each primer (sense: 5' -CAT 
GTG GGG GGG TTT GGA CGT G - 3', anfisense: 5' -GGG ATG GGT GCG 
GGG GTG TTG - 3'; SEQ ID Nos: 41 and 42, respectively). PGR was 
performed using the following cycle profile: 2' 94°C, 35 x (30" 94**C, 30" 
55 "G, 1' 72 *»G), 10 min 72 "G. PGR products were separated on a 1,2 % 
agarose gel. 

Example 1 

The vector pT7-TAGS (SEQ ID NO: 16) was constructed for the expression 
of a TAT-Gre fusion protein in E. coll. The plasmid contains the coding 
region of the 11 amino acid protein transduction domain of the wild-type 
HIV TAT protein (Green and Loewenstein, Gell, 55(6):1179-88 (1988); ' 
Frankel and Pabo, Gell, 55(6): 1189-93 (1988); SEQ ID NO: 10) fused to 
the N-termlnus of Gre recomblnase protein sequence. The 10-amino-acld 
strep tag at the G-terminus allows the detection and purification of the 
fusion protein using specific antibodies (Schmidt and Skerra, 2. 
Chromatogr A 676: 337-345 (1994)). The protease factor Xa recognition 
site (Ile-Glu-Gly-Arg) permits the removal of the strep tag by proteolytic 
cleavage. The estimated molecular weight of the TAT-Gre fusion protein is 
42 kDa. A scheme of the TAT-Gre expression vector is depicted in figure 2. 
For the expression of TAT-Gre, the E. coll strain BL21(DE3)-RIL 
(Stratagene) was used. This strain carries an IPTG-induclble T7 
polymerase gene and additional copies of the tRNA genes for the Yare 
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codons' argU; ileY and leuW. 

E. coli BL21(DE3)-RIL cells were transformed with pT7-TACS and grown in 
LB nnedlum containing 100 \ig/rr)\ ampicillin. The expression of the 40 kDa 
TAT-Cre fusion protein could be strongly induced by the addition of 0,5 
mM IPTG to the culture mediunn. Analysis of protein lysates revealed that 
approximately 50 % of TAT-Cre protein accumulated as insoluble inclusion 
bodies. The inclusion bodies where extracted and dissolved In 8 M urea. 
TAT-Cre was subsequently purified from this fraction using ion exchange 
chromatography. The quantity and purity of TAT-Cre protein was 
determined using Coomassie stained SDS-PAGE gels and Western blot 
analysis (figure 3). The purification process yielded TAT-Cre protein 
extracts of 64 % purity and a concentration of 100 pg/ml. 
To analyse the ability of the purified TAT-Cre protein to transduce into 
cultured cells, we used the fibroblast cell line M5Pax8 (R. Kiihn, 
unpublished) that contains a loxP-containing reporter construct This 
reporter, when recombined by Cre recombinase, allows the expression of 
a B-galacosidase gene (Buchholz et al, Nucleic Acids Res. 24, 4256-4262, 
1996). Cells were cultured for 18 h with increasing concentrations of TAT- 
Cre protein In serum-free medium and analysed 4 days later for B- 
Galacosidase activity. Staining with X-Gal sJiowed that > 50 % of the cells 
treated with 13,8 pg/ml TAT-Cre protein expressed B-galactosidase 
indicating recombination of the loxP-fianked reporter construct had 
occurred (figure 4). Measurement of B-galactosidase activity in cell 
lysates revealed an up to 30-foId higher level of B-galactosidase activity 
in comparison to cells which had been transiently transfected with an 
eukaryotic Cre expression vector (figure 5). 

To investigate the activity of TAT-Cre protein in a living organism, we used 
a transgenic mouse strain carrying a loxP-flanked target for Cre-mediated 
recombination (Thorey et al., 1998, MoK Cell. Biol. 18: 3081 - 3088). Mice 
where treated three times with intraperitoneal injections of 75 pg TAT Cre 
protein at two-day-intervals and analysed 2 days later. Genomic DNA was 
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isolated from a variety of organs and subjected to PGR amplification wliich ' 
specifically amplifies a 400 bp fragment of tlie recomblned allele. The 
deleted allele could be detected in multiple tissues from treated mice 
indicating TAT-Cre-medlated recombination in these organs (figure 6). 
This experiments demonstrates that TAT-mediated delivery of active Cre 
protein works with sufficient efficacy to facilitate inducible gene targeting 
in cell lines and in living organisms. 

Example 2 

The vectors pT7-VPCS (SEQ ID NO: 17) and pCRT7-AVPCS (SEQ ID NO: 15) 
were constructed for the expression of VP22-Cre and AVP22-Cre fusion 
proteins in E. coli. The VP22-Cre gene of pT7-VPCS contains the full length 
protein translocation domain of the HSV VP22 protein (Elliott and O'Hare, 
Cell, 88(2): 223-33 (1987), whereas the AVP22-Cre gene of pCRT7-AVPCS 
contains a truncated VP22 protein transduction domain (amino acids 159 - 
301; Invitrogen; aa 16-157 of SEQ ID N0:14) fused to the N-termlnus of 
Cre recombinase protein sequence. A 10-amlno-acid strep tag at the C- 
terminus of Cre protein sequence allows the detection and purification of 
the fusion proteins using specific antibodies (Schmidt and Skerra, J. 
Chromatogr A 676: 337-345 (1994)). The protease factor Xa recognition 
site permits the removal of the Strep tag by proteolytic cleavage. The 
estimated molecular weight is 75 kDa for VP22-Cre protein and 60 kDa for 
AVP22-Cre protein. A scheme of the vectors pT7-VPCS and pCRT7-AVPCS 
is depicted in figure 7. 

E. coll BL21(DE3)-RIP cells (Stratagene) were transformed with pT7-VPCS 
and cultured in LB medium containing 100 pg/ml ampiclllin. E. coli 
BL21(DE3)-pLysS cells (Stratagene) were transformed with pCRT7-AVPCS 
and cultured In LB medium containing 25 \i9/m\ kanamyclne and 34 pg/ml 
chloramphenicol. Expression of the VP22-Cre and AVP22-Cre fusion 
proteins could be Induced by the addition of 0,5 mM IPTG to the culture 
medium. Analysis of protein extracts using Coomassie staining and 
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Western blotting of SDS-PAGE gels revealed that 50 - 60 % of VP22-Cre 
and AVP22-Cre proteins accumulated as insoluble inclusion bodies. The 
Inclusion bodies where extracted and dissolved in 8 M urea. VP22-Cre and 
AVP22-Cre fusion proteins were subsequently purified using ion exchange 
chromatography. The quantity and purity of the isolated VP22-Cre and A 
VP22-Cre fusion proteins was determined using Coomassie stained SDS- 
PAGE gels and Western blot analysis (figure 8). 
To analyse the ability of the purified fusion proteins to transduce into 
cultured cells, we used the fibroblast cell line M5Pax8 that contains a loxP- 
containing reporter construct. When recombined by Cre recombinase, the 
reporter allows the expression of a B-galacosidase gene (Buchholz et al, 
Nucleic Acids Res. 24, 4256-4262, 1996). The cells where cultured for 18 
h with increasing concentrations of VP22-Cre and AVP22-Cre In serum-free 
medium and analysed 4 days later for S-Galacosldase activity. Staining 
with X-Gal showed ^2 % blue cells in the cultures treated with up to 15 
[jg/ml AVP22-Cre Indicating recombination of the loxP-flanked reporter 
construct had occurred. In contrast, cell cultures treated with up to 0,5 
Mg/ml VP22-Cre did not show any X-gal staining (figure 9). Measurement 
of cell lysates revealed a strong Increase of 6-galactosidase activity upon A 
VP22-Cre treatment when compared to untreated cells (figure 10). 
Genomic DNA was isolated fend subjected to PGR amplification that 
specifically amplifies a 250 bp fragment of the recombined allele. The 
deleted allele could be detected In cells treated with both VP22-Cre and A 
VP22-Cre fusion proteins (figure 11). 

This experiment demonstrates that VP22-medIated delivery of active Cre 
protein works with sufficient efficacy to facilitate inducible gene targeting. 
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SEQUENCE LISTING 
<110> ARTEMIS Pharmaceuticals GmbH 

<120> Transduction of recombinases for inducible gene 
targeting 

<130> 010007wo/JH/na 

<140> 
<141> 

<160> 42 

<170> PatentIn Ver, 2,1 



<210> 1 
<211> 1074 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TAT-Cre 

<220> 

<221> CDS 

<222> (1) . . (1071) 

<400> 1 

atg ggc tac ggc cgc aag aag cgc cgc caa cgc cgc cgc ggc atg tec . 48 
Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
15 10 15 

aat tta ctg acc gta cac caa aat ttg cct gca tta ccg gtc gat gca 96 
Asn Leu Leu Thr Val His Gin Asn Leu Pro Ala Leu Pro Val Asp Ala 
20 25 30 

acg agt gat gag gtt cgc aag aac ctg atg gac atg ttc agg gat cgc 144 
Thr Ser Asp Glu Val Arg Lys Asn Leu Met Asp Met Phe Arg Asp Arg 
35 40 45 

cag gcg ttt tct gag cat acc tgg aaa atg ctt ctg tec gtt tgc egg 192 
Gin Ala Phe Ser Glu His Thr Trp Lys Met Leu Leu Ser Val Cys Arg 
50 55 60 

teg tgg gcg gca tgg tgc aag ttg aat aac egg aaa tgg ttt ccc gca 240 
Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn Arg Lys Trp Phe Pro Ala 
65 70 75 80 

gaa cct gaa gat gtt cgc gat tat ctt eta tat ctt cag gcg ege ggt 288 
Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu Tyr Leu Gin Ala Arg Gly 
85 90 95 

ctg gca gta aaa act ate cag caa cat ttg ggc cag eta aac atg ctt 336 
Leu Ala Val Lys Thr lie Gin Gin His Leu Gly Gin Leu Asn Met Leu 
100 105 110 

cat cgt egg tec ggg ctg cca cga cca agt gac age aat get gtt tea' 384 
His Arg Arg Ser. Gly Leu Pro Arg Pro Ser Asp Ser Asn Ala Val Ser 
115 120 125 
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ctg gtt atg egg egg ate cga aaa gaa aac gtt gat gee ggt gaa cgt 432 
Leu Val Met Arg Arg He Arg Lys Glu Asn Val Asp Ala Gly Glu Arg 
130 135 140 

gca aaa cag get eta gcg ttc gaa cgc act gat ttc gac cag gtt cgt 4 80 
Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr Asp Phe Asp Gin Val Arg 
145 150 155 160 

tea etc atg gaa aat age gat cgc tgc cag gat ata cgt aat ctg gca 528 
Ser Leu Met Glu Asn Ser Asp Arg Cys Gin Asp He Arg Asn Leu Ala 
165 170 175 

ttt ctg ggg att get tat aac ace ctg tta cgt ata gee gaa att gee 576 
Phe Leu Gly He Ala Tyr Asn Thr Leu Leu Arg He Ala Glu He Ala 
180 185 190 

agg ate agg gtt aaa gat ate tea cgt act gac ggt ggg aga atg tta 624 
Arg He Arg Val Lys Asp He Ser Arg Thr Asp Gly Gly Arg Met Leu 
195 200 205 

ate eat att ggc aga aeg aaa acg ctg gtt age aee gca ggt gta gag 672 
He His He Gly Arg Thr Lys Thr Leu Val Ser Thr Ala Gly Val Glu 
210 215 220 

aag gea ett age ctg ggg gta act aaa ctg gtc gag cga tgg att tec 720 
Lys Ala Leu Ser Leu Gly Val Thr Lys Leu Val Glu Arg Trp He Ser 
225 230 235 240 

gtc tct ggt gta get gat gat ecg aat aac tac ctg ttt tgc egg gtc 768 
Val Ser Gly Val Ala Asp Asp Pro Asn Asn Tyr Leu Phe Cys Arg Val 
245 250 • 255 

aga aaa aat ggt gtt gcc gcg cca tct gee ace age cag eta tea act 816 
Arg Lys Asn Gly Val Ala Ala Pro Ser Ala Thr Ser Gin Leu Ser Thr 
260 265 270 

cgc gee ctg gaa ggg att ttt gaa gca act eat cga ttg att tac ggc 864 
Arg Ala Leu Glu Gly He Phe Glu Ala Thr His Arg Leu He Tyr Gly 
275 280 285 

get aag gat gac tct ggt cag aga tac ctg gcc tgg tct gga cac agt 912 
Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu Ala Trp Ser Gly His Ser 
290 2^5 300 

gcc cgt gtc gga gcc gcg cga gat atg gee cgc get gga gtt tea ata 950 
Ala Arg Val Gly Ala Ala Arg Asp Met Ala Arg Ala Gly Val Ser He 
305 310 315 320 

ceg gag ate atg caa get ggt ggc tgg ace aat gta aat att gtc atg * 1008 
Pro Glu He Met Gin Ala Gly Gly Trp Thr Asn Val Asn He Val Met 
325 330 335 

aac tat ate cgt aac ctg gat agt gaa aca ggg gca atg gtg cgc ctg 1056 
Asn Tyr He Arg Asn Leu Asp Ser Glu Thr Gly Ala Met Val Arg Leu 
340 345 350 

ctg gaa gat ggc gat tag 1074 
Leu Glu Asp Gly Asp 
355 



<210> 2 
<211> 357 
<212> PRT 
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<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TAT-Cre 

<400> 2 

Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser . 
1 5 10 . 15 

Asn Leu Leu Thr Val His Gin Asn Leu Pro Ala Leu Pro Val Asp Ala 
20 25 30 

Thr Ser Asp Glu Val Arg Lys Asn Leu Met Asp Met Phe Arg Asp Arg 
35 40 45 

Gin Ala Phe Ser Glu His Thr Trp Lys Met Leu Leu Ser Val Cys Arg 
50 55 60 

Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn Arg Lys Trp Phe Pro Ala 
65 70 75 80 

Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu Tyr Leu Gin Ala Arg Gly 
85 SO 95 

Leu Ala Val Lys Thr He Gin Gin His Leu Gly Gin Leu Asn Met Leu 
100 105 lib 

His Arg Arg Ser Gly Leu Pro Arg Pro Ser Asp Ser Asn Ala Val Ser 
115 120 125 

Leu Val Met Arg Arg He Arg Lys Glu Asri Val Asp Ala Gly Glu Arg 
130 135 . 140 

Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr Asp Phe Asp Gin Val Arg 
145 150 155 160 

Ser Leu Met Glu Asn Ser Asp Arg Cys Gin Asp He Arg Asn Leu Ala 
165 170 175 

Phe Leu Gly He Ala Tyr Asn Thr Leu Leu Arg He Ala Glu He Ala 
180 185 190 

Arg He Arg Val Lys Asp He Ser Arg Thr Asp Gly Gly Arg Met Leu 
195 200 205 

He His He Gly Arg Thr Lys Thr Leu Val Ser Thr Ala Gly Val Glu 
210 215 220 

Lys Ala Leu Ser Leu Gly Val Thr Lys Leu Val Glu Arg Trp He Ser 
225 230 235 240 

Val Ser Gly Val Ala Asp A3p Pro Asn Asn Tyr Leu Phe Cys Arg Val 
245 250 255 

Arg Lys Asn Gly Val Ala Ala Pro Ser Ala Thr Ser Gin Leu Ser Thr 
260 265 270 

Arg Ala Leu Glu Gly He Phe Glu Ala Thr His Arg Leu He Tyr Gly 
275 280 285 

Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu Ala Trp Ser Gly His Ser 
290 295 300 

Ala Arg Val Gly Ala Ala Arg Asp Met Ala Arg Ala Gly Val Ser He 
305 310 315 
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Pro Glu He Met Gin Ala Gly Gly Trp Thr Asn Val Asn He Val Met 
325 330 335 

Asn Tyr He Arg Asn Leu Asp Ser Glu Thr Gly Ala Met Val Arg Leu 
340 345 350 

Leu Glu Asp Gly Asp 
355 



<210> 3 
<211> 1317 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TAT-Flpe 

<220> 

<221> CDS 

<222> (1) . . (1311) 

<400> 3 



atg 
Met 
1 


ggc 
Gly. 


tac ggc 
Tyr Gly 


cgc 
Arg 
5 


aag 
Lys 


aag 
Lys 


cgc 
Arg 


cgc 
Arg 


caa cgc 
Gin Arg 
10 ' 


cgc 
Arg 


cgc 
Arg 


ggc atg agt 
Gly Met Ser 
15 




caa 
Gin 


ttt 
Phe 


gat ata 
Asp He 
20 


tta 
Leu 


tgt 
Cys 


aaa 
Lys 


aca 
Thr 


cca 
Pro 
25 


cet 
Pro 


aag 
Lys 


gtc 
Val 


ctg 
Leu 


gtt cgt 
Val Arg 
30 


cag 
Gin 


96 


ttt 
Phe 


gtg 
Val 


gaa agg 
Glu Arg 
35 


ttt 
Phe 


gaa 
Glu 


aga 
Arg 


cet 
Pro 
40 


tea 
Ser 


ggg 

Gly 


gaa 
Glu 


aaa 
Lys 


ata 
He 
45 


gca 
Ala 


tea tgt 
Ser Cys 


144 


get 
Ala 


get 
Ala 
50 


gaa eta 
Glu Leu 


acc 
Thr 


tat 
Tyr 


tta 
Leu 
55 


tgt 
Cys 


tgg 
Trp 


atg 
Met 


att 
He 


act 
Thr 
60 


cat 
His 


aac gga aca 
Asn Gly Thr 


19.2 


gca 
Ala 
65 


ate 
He 


aag aga 
Lys Arg 


gee 
Ala 


aca 
Thr 
70 


tte 
Phe 


atg 
Met 


age 
Ser 


tat 
Tyr 


aat 
Asn 
75 


act 
Thr 


ate 
He 


ata 
He 


age 
Ser 


aat 
Asn 
80 


240 


teg 
Ser 


ctg 
Leu 


agt ttc 
Ser Phe 


gat 
Asp 
85 


att 
He 


gte 
Val 


aac 
Asn 


aaa 
Lys 


tea 
Ser 
90 


etc 
Leu 


cag 
Gin 


ttt 
Phe 


aaa tac aag 
Lys Tyr Lys 
95 


288 


acg 
Thr 


caa 
Gin 


aaa gca 
Lys Ala 
100 


aca 
Thr 


att 
He 


ctg 
Leu 


gaa 
Glu 


gee 
Ala 
105 


tea 
Ser 


tta 
Leu 


aag 
Lys 


aaa 
Lys 


tta 
Leu 
110 


att 
He 


cet 
Pro 


336 


get 
Ala 


tgg 
Trp 


gaa ttt 
Glu Phe 
115 


aca 
Thr 


att 
He 


att 
He 


cet 
Pro 
120 


tac 
Tyr 


aat 
Asn 


gga 
Gly 


caa 
Gin 


aaa 
Lys 
125 


cat 
His 


caa 
Gin 


tct 
Ser 


384 


gat 
Asp 


ate 
He. 
130 


act gat 
Thr Asp 


att 
He 


gta 
Val 


agt 
Ser 
135 


agt 
Ser 


ttg 
Leu 


caa 
Gin 


tta 
Leu 


cag 
Gin 
140 


ttc 
Phe 


gaa 
Glu 


tea 
Ser 


teg 
Ser 


432 


gaa 
Glu 
145 


gaa -gca gat 
Glu Ala Asp 


aag 
Lys 


gga 
Gly 
150 


a at 
Asn 


age 
Ser 


cac 
His 


agt 
Ser 


aaa 
Lys 
155 


aaa 
Lys 


atg 
Met 


ctt aaa 
Leu Lys 


gca 
Ala 
160 


480 
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ctt eta agt gag ggt gaa age ate tgg gag ate act gag aaa ata eta 528 

Leu Leu Ser Glu Gly Glu Ser lie Trp Glu He Thr Glu Lys He Leu 

165 170 175 

aat teg ttt gag tat ace teg aga ttt aca aaa aca aaa act tta tac 576 

Asn Ser Phe Glu Tyr Thr Ser Arg Phe Tiir Lys Thr Lys Thr Leu Tyr 

180 = 185 190 

caa ttc etc ttc eta get act ttc ate aat tgt gga aga tte age gat 624 

Gin Phe Leu Phe Leu Ala Thr Phe He Asn Cys Gly Arg Phe Ser Asp 

195 200 205 

att aag aae gtt gat eeg aaa tea ttt aaa tta gtc eaa aat aag tat 672 

He Lys Asn Val Asp Pro Lys Ser Phe Lys Leu Val Gin Asn Lys Tyr 

210 215 220 

ctg gga gta ata ate cag tgt tta gtg aca gag aca aag aea age gtt 720 

Leu Gly Val He He Gin Cys Leu Val Thr Glu Thr Lys Thr Ser Val 

225 230 . 235 240 

agt agg eac ata tac ttc ttt age gea agg ggt agg ate gat eea ctt 768 

Ser Arg His He Tyr Phe Phe Ser Ala Arg Gly Arg He Asp Pro Leu 

245 250 255 

gta tat ttg gat gaa ttt ttg agg aat tct gaa eea gtc eta aaa ega 816 

Val Tyr Leu Asp Glu Phe Leu Arg Asn Ser Glu Pro Val Leu Lys Arg 

260 265 270 

gta aat agg acc ggc aat tct tea age aae aaa cag gaa tac caa tta 864 

Val Asn Arg Thr Gly Asn Ser Ser Ser Asn Lys Gin Glu Tyr Gin Leu 

275 280 285 

tta aaa gat aae tta gtc aga teg tae aac aag get ttg aag aaa aat 912 

Leu Lys Asp Asn Leu Val Arg Ser Tyr Asn Lys Ala Leu Lys Lys Asn 

290 295 300 

gcg cct tat eca ate ttt get ata aag aat ggc eea aaa tct eac att 950 

Ala Pro Tyr Pro He Phe Ala He Lys Asn Gly Pro Lys Ser His He 

305 310 315 320 

gga aga cat ttg atg ace tea ttt ctg tea atg aag ggc eta acg gag 1008 

. Gly Arg His Leu Met Thr Ser Phe Leu Ser Met Lys Gly Leu Thr Glu 

325 330 , 335 

ttg act aat gtt gtg gga aat tgg age gat aag cgt get tct gee gtg 1056 

Leu Thr Asn Val Val Gly Asn Trp Ser Asp Lys Arg Ala Ser Ala Val 

340 345 350 

gee agg aea acg tat act eat cag ata aea gea ata cct gat eac tac 1104 

Ala Arg Thr Thr Tyr Thr His Gin He Thr Ala He Pro Asp His Tyr 

355 360 365 

ttc gea eta' gtt tct egg tac tat gea tat gat eca ata tea aag gaa 1152 

Phe Ala Leu Val Ser Arg Tyr Tyr Ala Tyr Asp Pro He Ser Lys Glu 

370 375 380 

atg ata gea ttg aag gat gag act aat cea att gag gag tgg cag cat 1200 

Met He Ala Leu Lys Asp Glu Thr Asn Pro He Glu Glu Trp Gin His 

385 390 395 400 

ata gaa cag eta aag ggt agt get gaa gga age ata cga tac cec gea 1248 

He Glu Gin Leu Lys Gly Ser Ala Glu Gly .Ser He Arg Tyr Pro Ala 

405 410 415 
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tgg aat ggg ata ata tea cag gag gta eta gac tac ctt tea tec tae 1296 
Trp Asn Gly lie lie Ser Gin Glu Val Leu Asp Tyr Leu Ser Ser Tyr 
420 425 430 

ata aat aga cgc ata taatga 1317 
lie Asn Arg Arg lie 
435 • 



<210> 4 
<211> 437 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TAT-Flpe 

<400> 4 

Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
1 5 10 15 • 

Gin Phe Asp lie Leu Cys Lys Thr Pro. Pro Lys Val Leu Val Arg Gin 
20 25 30 , 

Phe Val Glu Arg Phe Glu Arg Pro Ser Gly Glu Lys lie Ala Ser Cys 
35 40 45 

Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met lie Thr His Asn Gly Thr 
50 55 60 

Ala lie Lys Arg Ala Thr Phe Met Ser Tyr Asn Thr lie lie Ser Asn 
65 70 75 80 

Ser Leu Ser Phe Asp lie Val Asn Lys Ser Leu Gin Phe Lys Tyr Lys 
85 90 95 

Thr Gin Lys Ala Thr lie Leu Glu Ala Ser Leu Lys Lys Leu lie Pro 
100 105 110 

Ala Trp Glu Phe Thr lie lie Pro Tyr Asn Gly Gin Lys His Gin Ser 
115 120 125 

Asp lie Thr Asp He Val Ser Ser Leu Gin Leu Gin Phe Glu Ser Ser 
130 135 140 

Glu Glu Ala Asp Lys Gly Asn Ser His Ser Lys Lys Met Leu Lys Ala 
145 150 155 160 

Leu Leu Ser Glu Gly Glu Ser He Trp Glu He Thr Glu Lys He Leu 
165 • 170 175 

Asn Ser Phe Glu Tyr Thr Ser Arg Phe Thr Lys Thr Lys Thr Leu Tyr 
180 • 185 190 

Gin Phe Leu Phe Leu Ala Thr Phe He Asn Cys Gly Arg Phe Ser Asp 
195 200 205 

He Lys Asn Val Asp Pro Lys Ser Phe Lys Leu Val Gin Asn Lys Tyr 
210 215 220 

Leu Gly- Val He He Gin Cys Leu Val Thr Glu Thr Lys Thr Ser Val 
225 230 235 240 
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Ser Arg His He Tyr Phe Phe Ser Ala Arg Gly Arg He Asp Pro Leu 
245 250 255 

Val Tyr Leu Asp Glu Phe Leu Arg Asn Ser Glu Pro Val Leu Lys Arg 
260 265 270 

Val Asn Arg Thr Gly Asn Ser Ser Ser Asn Lys Gin Glu Tyr Gin Leu 
275 280 285 

Leu Lys Asp Asn Leu Val Arg Ser Tyr Asn Lys Ala Leu Lys Lys Asn 
290 295 300 

Ala Pro Tyr Pro He Phe Ala He Lys Asn Gly Pro Lys Ser His He 
305 . 310 315 320 

Gly Arg His Leu Met Thr Ser Phe Leu Ser Met Lys Gly Leu Thr Glu 
325 330 335 

Leu Thr Asn Val Val Gly Asn Trp Ser Asp Lys Arg Ala Ser Ala Val 
340 345 350 

Ala Arg Thr Thr Tyr Thr His Gin He Thr Ala He Pro Asp His Tyr 
355 360 365 

Phe Ala Leu Val Ser Arg Tyr Tyr Ala Tyr Asp Pro He Ser Lys Glu 
370 375 380 

Met He Ala Leu Lys Asp Glu Thr Asn Pro He Glu Glu Trp Gin His 
385 390 395 400 

He Glu Gin Leu Lys Gly Ser Ala Glu Gly Ser He Arg Tyr Pro Ala 
405 410 415 

Trp Asn Gly He He Ser Gin Glu Val Leu Asp Tyr Leu Ser Ser Tyr 
420 425 430 

He Asn Arg Arg He 
435 



<210> 5 
<211> 2004 

<212> DNA • ■ 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22-Cre 

*<220> 

<221> CDS 

<222> (1) . . (2001) 

<400> 5 . 

atg acc tct cgc cgc tec gtg aag teg ggt ccg egg gag gtt ccg cgc 48 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
15 10 15 



gat gag tac gag gat ctg tac tac acc ccg tct tea ggt atg gcg agt ,96 
Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
20 25 30 



# 
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ccc gat agt ccg cct gac acc tec cgc cgt ggc gcc eta cag aea cgc 14 4 
Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 40 45 

teg cgc cag agg ggc gag gtc cgt ttc gte cag tac gac gag teg gat 192 
Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
50 55 60 

tat gcc etc tac ggg ggc teg tct tee gaa gac gac gaa cac ccg gag 240 
Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 80 

gtc ccc egg acg egg cgt ccc gtt tec ggg gcg gtt ttg tec ggc ccg 288 
Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 90 95 

ggg cct gcg egg gcg cct ccg cea ccc get ggg tec gga ggg gee gga 336 
Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 
100 105 110 

cgc aea ccc acc acc gcc ccc egg gee ccc cga acc cag egg gtg gcg 384 
Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 
115 120 125 

act aag gcc ccc gcg gee ccg gcg gcg gag acc ace cgc ggc agg aaa 432 
Thr Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lys 
130 135 140 

teg gee cag cca gaa tec gee gca etc eea gac gcc ccc gcg teg acg 480 
Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 
145 150 155 160 

gcg cca acc cga tec aag aea ccc gcg eag ggg ctg gcc aga aag ctg * 528 
Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 170 175 

cac ttt age acc gcc ccc cea aac ccc gac gcg cca tgg ace ccc egg 576 
His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 190 

gtg gcc ggc ttt aac aag cgc gtc ttc tgc gcc gcg gtc ggg cgc ctg 624 
Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
195 200 205 

gcg gcc atg cat gee egg atg gcg gcg gtc cag etc tgg gac atg teg 672 
Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
210 215 220 

cgt ccg cgc aea gac gaa gac etc aac gaa etc ctt ggc ate acc acc 720 
Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly He Thr Thr 
225 230 235 240 

ate cgc gtg acg gtc tgc gag ggc aaa aac ctg ctt cag cgc gcc aac 768 
He Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 

245 250 255 ' \; 

gag ttg gtg aat cea gac gtg gtg cag gac gtc gac gcg gcc acg gcg 816 
Glu Leu Val Asn Pro Asp Val Val Gin Asp Val 'Asp Ala Ala Thr Ala ' 
260 265 270 

act cga ggg cgt tct gcg gcg teg cgc cee ace gag cga cct cga gee 864 
Thr Arg Gly Arg Ser Ala Ala- Ser Arg Pro Thr Glu Arg Pro Arg Ala 
275 280 285 
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cca gcc cgc tec get tet cgc ccc aga egg ccc gtc gag ggt acc gag 912 

Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 ' 300 

etc gga tec act agt cca gtg tgg tgg aat tet gca gat ate cag cac 960 

Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp lie Gin His 
305 310 315 320 

agt ggc gge cgc atg tee aat tta ctg acc gta cae caa aat ttg cct 1008 

Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
325 330 335 

gca tta ccg gtc gat gca acg agt gat gag gtt cgc aag aac ctg atg 1056 

Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
340 345 350 

/ 

gac atg ttc agg gat cgc cag gcg ttt tet gag eat acc tgg aaa atg 1104 

Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His. Thr Trp Lys Met 

355 360 365 

ett ctg tec gtt tgc egg teg tgg gcg gca tgg tgc aag ttg aat aac 1152 

Leu Leu Ser Val Cys Afg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn 
370 375 380 

egg aaa tgg ttt ccc gca gaa cct gaa gat gtt cgc gat tat ctt eta 1200 

Arg Lys Trp Phe Piro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu 
385 390 395 400 

tat ett cag gcg cgc ggt ctg gca gta aaa act ate cag caa cat ttg 1248 

Tyr "Leu Gin Ala Arg Gly Leu Ala Val Lys Thr lie Gin Gin His Leu 
405 410 415 

ggc cag eta aac atg ctt cat cgt egg tec ggg ctg cca cga cca agt 1296 

Gly Gin Leu Asn Met Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser 
420 425 430 

gac age aat get gtt tea etg gtt atg egg egg ate cga aaa gaa aac 1344 

Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg lie Arg Lys Glu Asn 

435 440 445 

gtt gat gcc ggt gaa cgt gca aaa cag get eta gcg ttc gaa cgc act 1392 

Val Asp Ala Gly Glu Arg Ala Lys Gin Ala -Leu Ala Phe Glu Arg Thr 
450 455- .460 

gat ttc gac cag gtt cgt tea etc atg gaa aat age gat cgc tgc cag 144 0 

Asp Phe Asp Gin Val Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gin 
465 470 475 480 

gat ata cgt aat ctg gca ttt ctg ggg att get tat aac acc ctg tta 14 SB 

Asp lie Arg Asn Leu Ala Phe Leu Gly lie Ala Tyr Asn Thr Leu Leu 
485 490 495 

cgt ata gee gaa att gcc agg ate agg gtt aaa gat ate tea cgt act 1536 

Arg lie Ala Glu lie Ala Arg lie Arg Val Lys Asp He Ser Arg Thr 
500 505 510 

gac ggt ggg aga atg tta ate eat att ggc* aga acg- aaa acg ctg gtt 1584 

Asp Gly Gly Arg Met Leu He His He Gly Arg Thr Lys Thr Leu Val 

515 520 525 

age acc gca ggt gta gag aag gca ctt age etg ggg gta act aaa ctg 1632 

Ser Thr Ala Gly Val Glu Lys Ala Leu S r Leu Gly Val Thr Lys L u 
530 535 540 



I 
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gtc gag cga tgg att tec gtc tct ggt gta get gat gat ccg aat aac 1680 
Val Glu Arg Trp-Ile Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 
545 550 555 560 

tac ctg ttt tgc egg gtc aga aaa aat ggt gtt gcc gcg cca tct gcc 1728 
Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 
565 570 575 

ace age eag.cta tea act cgc gcc ctg gaa ggg att ttt gaa gca act 1776 
Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly He Phe Glu Ala Thr 
580 585 590 

cat cga ttg att tac ggc get aag gat gac tct ggt cag aga tae ctg 1824 
His Arg Leu He Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 
595 600 605 

gcc tgg tct gga cac agt gee cgt gtc gga gcc gcg cga gat atg gcc 1872 
Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 
610 615 620 

cgc get gga gtt tea ata ccg gag ate atg eaa get ggt ggc tgg ace 1920 
Arg Ala Gly Val Ser He Pro Glu, He Met Gin Ala Gly Gly Trp Thr 
625 630 635 640 

aat gta aat att gtc atg aae tat ate cgt aac ctg gat agt gaa aca 1968 
Asn Val Asn He Val Met Asn Tyr He Arg Asn Leu Asp Ser Glu Thr 
645 650 655 

ggg gea atg gtg ege ctg ctg gaa gat ggc gat tag 200*4 
Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp 
660 665 

<210> 6 
<211> 667 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22-Cre 

<400> 6 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
15 10 15 

Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
20 25 3D 

Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 40 45 

Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
50 55 60 

Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 80 

Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ma Val Leu Ser Gly Pro 
85 90 95 

Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 
100 105 110 

Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 
115 120 125 
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Thr Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arq Lvs 
130 135 140 

Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 
145 150 155 160 

Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 170 175 

His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 190 

Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
195 200 205 

Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
210 215 220 

Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly He Thr Thr 
225 230 235 240 

He Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 250 • 255 

Glu Leu Val JVsn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 

Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
275 280 285 



Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 300 

Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp He Gin His 
305 310 315 320 

Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
325 330 335 

Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
340 345 350 

Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His Thr Trp Lys Met 
355 360 365 

Leu Leu Ser Val Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn 
370 375 380 

Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu- Leu 
385 390 395 400 

Tyr Leu Gin Ala Arg Gly Leu Ala Val Lys Thr He Gin Gin His Leu 
405 410 415 

Gly Gin Leu Asn Met Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser 
420 425 430 

Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg He Arg Lys Glu Asn 
435 440. * 445 

Val Asp Ala Gly Glu Arg Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr 
450 455 460 
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Asp Phe Asp Gin Val Arg Ser Leu Met Glu Asn Sex Asp Arg Cys Gin 
465 4-70 475 480 

Asp He Arg Asn Leu Ala Phe Leu Gly He Ala Tyr Asn Thr Leu Leu 
485 490 495 

Arg He Ala Glu He Ala Arg He Arg Val Lys Asp He Ser Arg Thr 
500 ! 505 510 

Asp Gly Gly Arg Met Leu He His He Gly Arg Thr Lys Thr Leu Val 
515 520 525 

Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu 
530 535 540 

Val Glu Arg Trp He Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 
545 550 555 ' 560 

Tyr Leu Phe Cys -Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 
565 570 575 

Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly He Phe Glu Ala Thr 
580 585 590 

His Arg Leu He Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 
595 600 605 

Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 
610 615 620 

Arg Ala Gly Val Ser He Pro Glu He Met Gin Ala Gly Gly Trp Thr 
625 630 635 .640 

Asn Val Asn He Val Met Asn Tyr He Arg Asn Leu Asp Ser Glu Thr 
.645 650 655 

Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp 
660 665 



<210> 7 
<211> 2247 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22-Flpe 

<220> 

<221> CDS 

<222> (1) . . (2241) 

<400> 7 

atg acc tct cgc cgc tec gtg aag teg ggt ccg egg gag gtt ceg cgc 48 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
15 10 15 



gat gag tac gag gat ctg tac tac acc ccg tct tea ggt atg gcg agt 96 
Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
20 - 25 30 



wo 01/49832 PCT/EPOl/00060 

38 

ccc gat agt ccg cct gac acc tec cgc cgt ggc gcc eta cag aca cgc 144 
Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 40 45 

teg cgc cag agg ggc gag gtc cgt ttc gtc cag tac gac gag teg gat 192 
Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
50 55 60 

tat gcc etc tac ggg ggc teg tet tec gaa gac gac gaa cac ccg gag 240 
Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 80 

gtc ccc egg acg egg cgt ccc gtt tec ggg gcg gtt ttg tec ggc ccg 288 
Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 90 95 

ggg cct gcg egg gcg cct ccg cea ccc get ggg tee gga ggg gcc gga 336 
Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 
100 105 110 

cgc aca ccc acc acc gcc ccc egg gcc ccc cga acc cag egg gtg gcg 384 
Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ma 
115 120 125 

act aag gcc ccc gcg gcc ccg gcg gcg gag acc ace cgc ggc agg aaa 432 
Thr Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lys 
130 135 140 

teg gee cag cea gaa tec gcc gca etc cea gac gcc ccc gcg teg acg 480 
Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 
145 150 155 160 

gcg cea acc cga tec aag aca ccc gcg cag ggg ctg gcc aga aag ctg 528 
Ala. Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 170 175 

cac ttt age acc gcc ccc cea aac ccc gac gcg cea tgg acc ccc egg 576 
His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 190 

gtg gcc ggc ttt aac aag cgc gtc ttc tgc gcc gcg gtc ggg cgc ctg '624 
Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
195 200 205 

gcg gcc atg cat gcc egg atg gcg gcg gtc cag etc tgg gac atg teg 672 
Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
210 215 220 

cgt ccg cgc aca gac gaa gac etc aac gaa etc ctt ggc ate acc acc 720 
Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly He Thr Thr 
225 230 235 240 

ate cgc gtg acg gtc tgc gag ggc -aaa aac ctg ctt cag cgc gcc aac 7 68 
He Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 250 255 

gag ttg gtg aat eca gac gtg gtg cag gac gtc gac gcg gcc acg gcg 816 
Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 

act cga ggg cgt tet gcg gcg teg cgc ccc acc gag cga cct cga gcc 8 64 
Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
275 280 285 
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cca gcc cgc tec get tct cgc ccc aga egg cec gtc gag ggt ace gag 912 
Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 300 

etc gga tec act agt cca gtg tgg tgg aat tct gca gat ate cag cac 960 
Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp lie Gin His 
305 310 315 ' 320 

agt ggc ggc cgc atg agt caa ttt gat ata tta tgt aaa aca cca cet 1008 
Ser Gly Gly Arg Met Ser Gin Phe Asp lie. Leu Cys Lys Thr Pro Pro 
325 330 335 

aag gtc ctg gtt cgt cag ttt gtg gaa agg ttt gaa aga cct tea ggg 1056 
Lys Val Leu Val Arg Gin Phe Val Glu Arg Phe Glu Arg Pro Ser Gly 
340 345 350 

gaa aaa ata gca tea tgt get get gaa eta acc tat tta tgt tgg atg 1104 
Glu Lys lie Ala Ser Cys Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met 
355 360 365 

att act cat aac gga aca gca ate aag aga gee aca ttc atg age tat . 1152 
lie Thr His Asn Gly Thr Ala lie Lys Arg Ala Thr Phe Met Ser Tyr 
370 375 380 

aat act ate ata age aat teg ctg agt ttc gat att gtc aac aaa tea 1200 
Asn Thr lie lie Ser Asn Ser Leu Ser Phe Asp lie Val Asn Lys Ser 
385 390 395 400 

etc cag ttt aaa tac aag aeg caa aaa gca aca att ctg gaa gee tea . 1248 
Leu Gin Phe Lys Tyr Lys Thr Gin Lys Ala Thr lie Leu Glu Ala Ser 
405 410 415 

tta aag aaa tta att cct get tgg gaa ttt aea^ att att cct tac aat 1296 
Leu Lys Lys Leu lie Pro Ala Trp Glu Phe Thr "lie He Pro Tyr Asn 
420 425 430 

gga caa aaa cat caa tct gat ate act gat att gta agt agt ttg caa 1344 
Gly Gin Lys His Gin Ser Asp He Thr Asp He Val Ser Ser Leu Gin 
435 I 440 445 

tta cag ttc gaa tea teg gaa gaa gca gat aag gga aat age cac agt 1392 
Leu Gin Phe Glu Ser Ser Glu Glu Ala Asp Lys Gly Asn Ser His Ser 
450 455 460 

aaa aaa atg ctt aaa gca ctt eta agt gag ggt gaa age ate tgg gag 1440 
Lys Lys Met Leu Lys Ala Leu Leu Ser Glu Gly Glu Ser He Trp Glu 
465 470 475 480 

ate act gag _aaa ata eta aat teg ttt gag tat acc teg aga ttt aca 1488 
He Thr Glu Lys He Leu Asn Ser Phe Glu Tyr Thr Ser Arg Phe Thr 
485 490 495 

aaa aca aaa act tta tac caa ttc -etc ttc eta" get act ttc ate aat 1535 
Lys Thr Lys Thr Leu Tyr Gin Phe Leu Phe Leu Ala Thr Phe He Asn 
500 505 510 

tgt gga aga ttc age gat att aag aac gtt gat ccg aaa tea ttt aaa 1584 
Cys Gly Arg Phe Ser Asp He Lys Asn Val Asp Pro Lys Ser Phe Lys 
515 520 525 

tta gtc caa aat aag tat ctg gga gta ata ate cag tgt tta gtg aca 1632 
Leu Val Gin Asn Lys Tyr Leu Gly Val He He Gin Cys Leu Val Thr 
530 535 540 
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gag aca aag aca age gtt agt agg cac ata tac ttc ttt age gca agg 

Glu Thr Lys Thr Ser Val Ser Arg His lie Tyr Phe Phe Ser Ala Arg 

5-45 550 555 560 

ggt agg ate gat cca ctt gta tat ttg gat gaa ttt ttg agg aat tct 

Gly Arg lie Asp Pro Leu Val Tyr Leu Asp Glu Phe Leu Arg Asn Ser 
565 570 575 



1680 



1728 



gaa cca gtc eta aaa cga gta aat agg acc ggc aat tct tea age aac 
Glu Pro Val Leu Lys Arg Val Asn Arg Thr Gly Asn Ser Ser Ser Asn 
580 585 590 

aaa cag gaa tac caa tta tta aaa gat aac tta gtc aga teg tac aac 
Lys Gin Glu Tyr Gin Leu Leu Lys Asp Asn Leu Val Arg Ser Tyr Asn 
595 600 605 

aag get ttg aag aaa aat gcg ect tat cca ate ttt get ata aag aat 
Lys Ala Leu Lys Lys Asn Ala Pro Tyr Pro lie Phe Ala lie Lys Asn 
610 615 620 

ggc cca aaa tct cac att gga aga eat ttg atg acc tea ttt ctg tea 
Gly Pro Lys Ser His lie Gly Arg His Leu Met Thr Ser Phe Leu Ser 
625 630 635 640 

atg aag ggc eta acg gag ttg act aat gtt gtg gga aat tgg age gat 
Met Lys Gly Leu Thr Glu Leu Thr Asn Val Val Gly Asn Trp Ser Asp 
645 650 655 

aag cgt get tct gee gtg gee agg aca acg tat act cat cag ata aca 
Lys Arg Ala Ser Ala Val Ala Arg Thr Thr .Tyr Thr His Gin He Thr 
660 665 670 

gca ata cct gat cac tac ttc gca eta gtt tct egg tac tat gca tat 
Ala He Pro Asp His Tyr Phe Ala Leu Val Ser Arg Tyr Tyr Ala Tyr 
675 680. 685 

gat cca ata tea aag gaa atg ata gca ttg aag gat gag act aat cca 
Asp Pro He Ser Lys Glu Met He Ala Leu Lys Asp Glu Thr Asn Pro 

690 . 695 700 

att gag gag tgg cag cat ata gaa cag eta aag ggt agt get gaa gga 
He Glu Glu Trp Gin His He Glu Gin Leu Lys Gly Ser Ala Glu Gly 
705 710 715 720 

age ata cga tac ccc gca tgg aat ggg ata ata tea cag gag gta eta 
Ser He Arg Tyr Pro Ala Trp Asn Gly He He Ser Gin Glu Val Leu 
725 730 735 

gae tac ctt tea tec tac ata aat aga egc ata taatga 
Asp Tyr Leu Ser Ser Tyr He Asn Arg Arg He 
740 745 



1775 



1824 



1872 



1920 



1968 



2016 



2064 



2112 



2160 



2208 



2247 



<210> 8 
<211> 747 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22-Flpe 

<400> 8 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
15 10 15 
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Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly 
20 25 



Met Ala Ser 
30 



Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 40 45 

Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
50 55 60 

Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 80 

Val Pro Axg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 90 95 

Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 
100 105 110 

Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 
115 120 125 

Thr Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lys 
130 135 140 

Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 
145 150 155 160 

Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 170 175 

His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 190 

Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
195 200 205 

Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
210 215 220 

Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly He Thr Thr 
225 230 235 240 

He Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 250 255 

Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 

Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
275 280 265 

Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 300 

Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp He Gin His 
305 310 315 320 

Ser Gly Gly Arg Met Ser Gin Phe Asp He Leu Cys Lys Thr Pro Pro 
325 330 335 

Lys Val Leu Val Arg Gin Phe Val Glu Arg Phe Glu Arg Pro Ser Gly 



340 



345 



350 
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Glu Lys lie Ala Ser Cys Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met 
355 360 365 

lie Thr His Asn Gly Thr Ala lie Lys Arg Ala Thr Phe Met Ser Tyr 
370 375 3B0 

Asn Thr lie lie Ser Asn Ser Leu Ser Phe Asp lie Val Asn Lys Ser 
385 390 395 400 

Leu Gin Phe Lys Tyr Lys Thr Gin Lys Ala Thr lie Leu Glu Ala Ser 
405 410 415 

Leu Lys Lys Leu lie Pro Ala Trp Glu Phe Thr lie lie Pro Tyr Asn 
420 425 430 

Gly Gin Lys His Gin Ser Asp He Thr Asp He Val Ser Ser Leu Gin 
435 . 440 445 

Leu Gin Phe Glu Ser Ser Glu Glu Ala Asp Lys Gly Asn Ser Jiis Ser 
450 455 460 

Lys Lys Met Leu Lys Ala Leu Leu Ser Glu Gly Glu Ser He Trp Glu 
465 470 475 480 

He Thr Glu Lys He Leu Asn Ser Phe Glu Tyr Thr Ser Arg Phe Thr 
485 490 495 

Lys Thr Lys Thr Leu Tyr Gin Phe Leu Phe Leu Ala Thr Phe He Asn 
500 505 510 

Cys Gly Arg Phe Ser Asp He Lys Asn Val Asp Pro Lys Ser Phe Lys 
515 520 525 

Leu Val Gin Asn Lys Tyr Leu Gly Val He He Gin Cys Leu Val Thr 
530 535 540 

Glu Thr Lys Thr Ser Val Ser Arg His He Tyr Phe Phe Ser Ala Arg 
545 550 555 560 

Gly Arg He Asp Pro Leu Val Tyr Leu Asp Glu Phe Leu Arg Asn Ser 
565 570 575 

Glu Pro Val Leu Lys Arg Val Asn Arg Thr Gly Asn Ser Ser Ser Asn 
580 585 590 

Lys Gin Glu Tyr Gin Leu Leu Lys Asp Asn Leu Val Arg Ser Tyr Asn 
595 600 605 

Lys Ala Leu Lys Lys Asn Ala Pro Tyr Pro He Phe Ala He Lys Asn 
610 615 620 

Gly Pro Lys Ser His He Gly Arg His Leu Met Thr Ser Phe Leu Ser 
625 630 635 640 

Met Lys Gly Leu Thr Glu Leu Thr Asn Val Val Gly Asn Trp Ser Asp 
645 650 655 



Lys Arg Ala Ser Ala Val Ala Arg Thr Thr Tyr Thr His Gin He Thr 

660 665 670 

Ala He Pro Asp His Tyr Phe Ala Leu Val Ser Arg Tyr Tyr Ala Tyr 

675 680 * 685 
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Asp Pro lie Ser Lys Glu Met He Ala Leu Lys Asp Glu Thr Asn Pro 
690 695 700 

lie Glu Glu Trp Gin His He Glu Gin Leu Lys Gly Ser Ala Glu Gly 
705 710 715 720 

Ser He Arg Tyr Pro Ala Trp Asn Gly He He Ser Gin Glu Val Leu 
725 730 735 

Asp Tyr Leu Ser Ser Tyr He Asn Arg Arg He 
740 745 



<210> 9 
<211> 33 
<212> DMA 

<213> Human immunodeficiency virus 
<400> 9 • 

tacggccgca agaagcgccg ccaacgccgc cgc 33 



<210> 10 
<211>'ll 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 10 

Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg 
1 5 10 



<210> 11 
<211> 42 
<212> DNA 

<213> Human immunodeficiency virus 

<220> 
<221> CDS 
<222> (4) . . (42) 

<400> 11 

atg ggc tac ggc cgc aag aag cgc cgc caa cgc cgc cgc ggc 42 
Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly 
1 5 10 



<210> 12 
<211> 13 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 12 

Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly 
15 10 



<210> 13 
<211> 1623 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein cleltaVP22cre-StrepTag 

<220> 

<221> CDS 

<222> (1) . . (1617) 

<400> 13 

atg get age atg act ggt gga cag caa atg ggt egg gat ccg teg acg 48 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg Asp Pro Ser Thr 

1 5 10 15 

gcg cca acc cga tec aag aca ccc gcg cag ggg ctg gcc aga aag ctg 96 
Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
20 25 30 

cac ttt age ace gcc ccc cca aac ccc gac gcg cea tgg ace cec egg 144 
His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
35 40 .45 

gtg gcc ggc ttt aac aag cgc gtc ttc tgc gcc gcg gtc ggg cgc ctg 192 
Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
50 55 60 

geg gcc atg cat gcc egg atg gcg get gtc cag etc tgg gac atg teg 240 
Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
65 70 75 80 

cgt ccg cgc aca gae gaa gac etc aac gaa etc ctt ggc ate acc acc 288 
Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
85 90 95 

ate cgc gtg acg gtc tgc gag ggc aaa aac ctg ctt cag cgc gee aac 336 
lie Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
100 105 110 

gag ttg gtg aat cca gac gtg gtg cag gac gtc gac gcg gcc acg gcg 384 
Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
115 120 125 

act cga ggg cgt tet gcg gcg teg cgc ccc ace gag cga cct cga gcc 432 
Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
130 135 140 

cea gcc cgc tec get tet cgc ccc aga egg ccc gtc gag ggt acc gag 480 
Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
145 150 155 160 

etc gga tec act agt cca gtg tgg tgg aat tet gca gat ate cag cac 528 
Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp He Gin His 
165 170 175 

agt ggc ggc cgc atg tec aat. tta ctg acc gta cac caa aat ttg cct 576 
Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
180 185 190 

gca tta ccg gtc gat gca acg agt gat gag gtt cgc aag aac ctg atg 624 
Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
195 200 205 

gac atg ttc agg gat cgc cag gcg ttt tet gag cat ace tgg aaa atg 672. 
Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His Thr Trp Lys Met 
210 215 220 
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ctt ctg tec gtt tgc egg teg tgg gcg gca tgg tgc aag ttg aat aac 720 

Leu Leu Ser Val Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn 

225 230 235 240 

egg aaa tgg ttt ccc gca gaa cct gaa gat gtt cgc gat tat ctt eta 768 
Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu 
245 250 255 

tat ctt cag gcg cgc ggt ctg gca gta aaa act ate cag caa cat ttg BIS 
Tyr Leu Gin Ala Arg Gly Leu Ala Val Lys Thr lie Gin Gin His Leu 
260 265 270 

ggc cag eta aac atg ctt cat cgt egg tec ggg ctg cea ega cca agt 864 
Gly Gin Leu Asn Met Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser 
275 280 285 

gac age aat get gtt tea ctg gtt atg egg egg ate cga aaa gaa aac 912 
Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg lie Arg Lys Glu Asn 
290 295 300 

gtt gat gee ggt gaa cgt gca aaa cag get eta gcg ttc gaa cgc act 960 
Val Asp Ala Gly Glu Arg Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr 
305 310 315 320 

gat ttc gac cag gtt cgt tea etc atg gaa aat age gat cgc tgc cag 1008 
Asp Phe Asp Gin Val Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gin 
325 330 335 

gat ata cgt aat ctg gca ttt ctg ggg att get tat aac acc ctg tta 1056 
Asp lie Arg Asn Leu Ala Phe Leu Gly lie Ala Tyr Asn Thr Leu Leu 
340 345 350 

cgt ata gee gaa att gee agg ate agg gtt aaa gat ate tea cgt act 1104 
Arg lie Ala Glu He Ala Arg He Arg Val Lys Asp He Ser Arg Thr 
355 360 365 

gac ggt ggg aga atg tta ate cat att ggc aga acg aaa acg ctg gtt 1152 
Asp Gly Gly Arg Met Leu He His He Gly Arg Thr Lys Thr Leu Val 
370 375 380 

age acc gca ggt gta gag aag gca ctt age ctg ggg gta act aaa ctg 1200 
Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu 
385 390 395 400 

gtc gag cga tgg att tec gtc tct ggt gta get gat gat ccg aat aac 1248 
Val Glu Arg Trp He Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 
405 410 415 

tac ctg ttt tgc egg gtc aga aaa aat ggt gtt gee gcg cca tct gee 1296 
Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 
420 425 430 

acc age cag eta tea act cgc gee ctg gaa ggg att ttt gaa gca act 1344 
Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly He Phe Glu Ala Thr 
435 440 445 



cat cga ttg att tac ggc get aag gat gac tct ggt cag aga tac ctg 
His Arg Leu He Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 
450 455 460 



1392 



gee tgg tct gga cac agt gee cgt gtc gga gee gcg cga gat atg gee 1440 
Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 
465 470 475 480 
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cgc get gga gtt tea ata ccg gag ate atg caa get ggt ggc tgg acc 1488 
Arg Ala Gly Val Ser lie Pro Glu He Met Gin Ala Gly Gly Trp Thr 
485 490 495 

aat gta aat att gtc atg aac tat ate cgt aac ctg gat agt gaa aca 1536 
Asn Val Asn He Val Met Asn Tyr He Arg Asn Leu Asp Ser Glu Thr 
500 505 510 

ggg gca atg gtg cgc ctg ctg gaa gat ggc gat ggt ate gaa ggt cgt 1584 
Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp Gly He Glu Gly Arg 
515 520 525 

ggt age get tgg cgt cac ccg cag ttc ggt ggt taataa 1623 
Gly Ser Ala Trp Arg His Pro Gin Phe Gly Gly 
530 535 

<210> 14 
<211> 539 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein deltaVP22cre-StrepTag, 

<400> 14 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg Asp Pro Ser Thr 
15 10 15 

Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
20 25 30 

His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
35 40 45 

Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
50 55 60 

Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
65 70 75 80 

Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly He Thr Thr 
85 90 95 

He Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
100 105 110 

Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
115 120 125 

Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
130 135 140 

Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
145 150 155 160 

Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp He Gin His 
165 170 175 

Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
180 185 190 

Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
195 200 205 
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Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His Thr Trp Lys Met 
210 215 220 

Leu Leu Ser Val Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn 
225 230 235 240 

Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu 
. 245 250 255 

Tyr Leu Gin Ala Arg Gly Leu Ala Val Lys Thr He Gin Gin His Leu 
260 265 270 

Gly Gin Leu Asn Met Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser 
275 280 285 

Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg He Arg Lys Glu Asn 
290 ^ 295 300 

Val Asp Ala Gly Glu Arg Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr 
305 310 315 320. 

Asp Phe Asp Gin Val Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gin 
325 330 335 

Asp He Arg Asn Leu Ala Phe Leu Gly He Ala Tyr Asn Thr Leu Leu 
340 345 350 

Arg He Ala Glu He Ala Arg He Arg Val Lys Asp He Ser Arg Thr 
355 360 365 

Asp Gly Gly Arg Met Leu He His He Gly Arg Thr Lys Thr Leu Val 
370 375 380 

Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu 
385 390 395 - 400 

Val Glu Arg Trp He Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 
405 410 415 

Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 
420 425 430 

Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly He Phe Glu Ala Thr 
435 440 445 

His Arg Leu He Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 
450 455 .460 

Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 
465 470 475 480 

Arg Ala Gly Val Ser He Pro Glu He Met Gin Ala Gly Gly Trp Thr 
485 490 495 

Asn Val Asn He Val Met Asn Tyr He Arg Asn Leu Asp Ser Glu Thr 
500 505 510 

Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp Gly He Glu Gly Arg 
515 520 525 



Gly Ser Ala Trp Arg His Pro Gin Phe Gly Gly 
530 535 
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<210> 15 
<211> 5953 

<2i2> om 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
pCRT7-deltaVPCS 

<400> 15 

cgatggattt ccgtctctgg tgtagctgat gatccgaata actacctgtt ttgccgggtc 60 
agaaaaaatg gtgttgccgc gccatctgcc accagccagc tatcaactcg cgccctggaa 120 
gggatttttg aagcaactca tcgattgatt tacggcgcta aggatgactc tggtcagaga 180 
tacctggcct ggtctggaca cagtgcccgt gtcggagccg cgcgagatat ggcccgcgct 240 
ggagtttcaa taccggagat. catgcaagct ggtggctgga ccaatgtaaa tattgtcatg 300 
aactatatcc gtaacctgga tagtgaaaca ggggcaatgg tgcgcctgct ggaagatggc 360 
gatggtatcg aaggtcgtgg tagcgcttgg cgtcacccgc agttcggtgg ttaataagct 420 
tcgaacaaaa actcatctca gaagaggatc tgaatatgca taccggtcat catcaccatc 480 
accattgagt tttgagcaat aactagcata accccttggg gcctctaaac gggtcttgag 540 
gggttttttg ctgaaaggag gaactatatc cggatatcca caggacgggt gtggtcgcca 600 
tgatcgcgta gtcgatagtg gctccaagta gcgaagcgag caggactggg cggcggccaa 660 
agcggtcgga cagtgctccg agaacgggtg cgcatagaaa ttgcatcaac gcatatagcg 720 
ctagcagcac gccatagtga ctggcgatgc tgtcggaatg gacgatatcc cgcaagaggc 780 
ccggcagtac cggcataacc aagcctatgc ctacagcatc cagggtgacg gtgccgagga 840 
tgacgatgag cgcattgtta gatttcatac acggtgcctg- actgcgttag caatttaact 900 
gtgataaact accgcattaa agcttatcga tgataagctg tcaaacatga gaattaattc 960 
ttagaaaaac tcatcgagca tcaaatgaaa ctgcaattta ttcatatcag gattatcaat 1020 
accatatttt tgaaaaagcc gtttctgtaa tgaaggagaa aactcaccga ggcagttcca 1080 
taggatggca agatcctggt atcggtctgc gattccgact cgtccaacat caatacaacc 114 0 
tattaatttc ccctcgtcaa aaataaggtt atcaagtgag aaatcaccat gagtgacgac 1200 
tgaatccggt gagaatggca aaagcttatg catttctttc cagacttgtt caacaggcca 1260 
gccattacgc tcgtcatcaa aatcactcgc atcaaccaaa ccgttattca ttcgtgattg 1320 
cgcctgagcg agacgaaata cgcgatcgct gttaaaagga caattacaaa caggaatcga 1380 
atgcaaccgg cgcaggaaca ctgccagcgc atcaacaata ttttcacctg aatcaggata 1440 
ttcttctaat acctggaatg ctgttttccc ggggatcgca gtggtgagta accatgcatc 1500 
atcaggagta cggataaaat gcttgatggt cggaagaggc ataaattccg tcagccagtt 1560 
tagtctgacc atctcatctg taacatcatt ggcaacgcta cctttgccat gtttcagaaa 1620 
caactctggc gcatcgggct tcccatacaa tcgatagatt gtcgcacctg attgcccgac 1680 
attatcgcga gcccatttat acccatataa atcagcatcc atgttggaat ttaatcgcgg 1740 
cctcgagcaa gacgtttccc gttgaatatg gctcataaca ccccttgtat tactgtttat 1800 
gtaagcagac agttttattg ttcatgacca aaatccctta acgtgagttt tcgttccact 1860 
gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg agatcctttt tttctgcgcg 1920 
taatctgctg cttgcaaaca aaaaaaccac cgctaccagc ggtggtttgt ttgccggatc 1980 
aagagctacc aactcttttt ccgaaggtaa: ctggcttcag cagagcgcag ataccaaata 2040 
ctgtccttct agtgtagccg tagttaggcc accacttcaa gaactctgta gcaccgccta 2100 
catacctcgc tctgctaatc ctgttaccag tggctgctgc cagtggcgat aagrtcgtgtc 2160 
ttaccgggtt ggactcaaga cgatagttac cggataaggc gcagcggtcg ggctgaacgg 2220 
ggggttcgtg cacacagccc agcttggagc gaacgaccta caccgaactg agatacctac 2280 
agcgtgagct atgagaaagc gccacgcttc ccgaagggag aaaggcggac aggtatccgg 2340 
taagcggcag ggtcggaaca ggagagcgca cgagggagct tccaggggga aacgcctggt 2400 
atctttatag tcctgtcggg tttcgccacc tctgacttga gcgtcgattt ttgtgatgct 24 60 
cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc ggccttttta cggttcctgg 2520 
ccttttgctg gccttttgct cacatgttct ttcctgcgtt atcccctgat tctgtggata 2580 
accgtattac cgcctttgag tgagctgata ccgctcgccg cagccgaacg accgagcgca 2640 
gcgagtcagt gagcgaggaa gcggaagagc gcctgatgcg gtattttctc cttacgcatc 2700 
tgtgcggtat ttcacaccgc atatatggtg cactctcagt acaatctgct ctgatgccgc 27 60 
atagttaagc cagtatacac tccgctatcg ctacgtgact gggtcatggc tgcgccccga 2820 
cacccgccaa cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac 2880 
agacaagctg tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg 2940 
aaacgcgcga ggcagctgcg gtaaagctca tcagcgtggt cgtgaagcga ttcacagatg 3000 
tctgcctgtt catccgcgtc cagctcgttg, agtttctcca gaagcgttaa tgtctggctt 3060 
ctgataaagc gggccatgtt aagggcggtt " ttttcctgtt tggtcactga tgcctccgtg 3120 
taagggggat ttctgttcat gggggtaatg ataccgatga aacgagagag gatgctcacg 3180 
atacgggtta ctgatgatga acatgcccgg ttactggaac gttgtgaggg taaacaactg 3240 
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gcggtatgga tgcggcggga ccagagaaaa atcactcagg gtcaatgcca gcgcttcgtt 3300 
aatacagatg taggtgttcc acagggtagc cagcagcatc ctgcgatgca gatccggaac 3360 
ataatggtgc agggcgctga cttccgcgtt tccagacttt acgaaacacg gaaaccgaag 3420 
accattcatg. ttgttgctca ggtcgcagac gttttgcagc agcagtcgct tcacgttcgc 3480 
tcgcgtatcg gtgattcatt ctgctaacca gtaaggcaac cccgccagcc tagccgggtc 3540 
ctcaacgaca ggagcacgat catgcgcacc cgtggccagg acccaacgct gcccgagatg 3600 
cgccgcgtgc ggctgctgga gatggcggac gcgatggata tgttctgcca agggttggtt 3660 
tgcgcattca cagttctccg caagaattga .ttggctccaa ttcttggagt ggtgaatccg 3720 
ttagcgaggt gccgccggct tccattcagg tcgaggtggc ccggctccat gcaccgcgac 37 BO 
gcaacgcggg gaggcagaca aggtataggg cggcgcctac aatccatgcc aacccgttcc 3840 
atgtgctcgc cgaggcggca taaatcgccg tgacgatcag cggtccagtg atcgaagtta 3900 
ggctggtaag agccgcgagc gatccttgaa gctgtccctg atggtcgtca tctacctgcc 3960 
tggacagcat ggcctgcaac gcgggcatcc cgatgccgcc ggaagcgaga agaatcataa 4020 
tggggaaggc catccagcct cgcgtcgcga acgccagcaa gacgtagccc agcgcgtcgg 4 0B0 
ccgccatgcc ggcgataatg gcctgcttct cgccgaaacg tttggtggcg ggaccagtga 414 0 
cgaaggcttg agcgagggcg tgcaagattc cgaataccgc aagcgacagg ccgatcatcg 4200 
tcgcgctcca gcgaaagcgg tcctcgccga aaatgaccca gagcgctgcc ggcacctgtc 4260 
ctacgagttg catgataaag aagacagtca taagtgcggc gacgatagtc atgccccgcg 4320 
cccaccggaa ggagctgact gggttgaagg ctctcaaggg catcggtcga cgctctccct 4380 
tatgcgactc ctgcattagg aagcagccca gtagtaggtt gaggccgttg agcaccgccg 444 0 
ccgcaaggaa tggtgcatgc aaggagatgg cgcccaacag tcccccggcc acggggcctg 4500 
ccaccatacc cacgccgaaa caagcgctca tgagcccgaa gtggcgagcc cgatcttccc 4560 
catcggtgat gtcggcgata taggcgccag caaccgcacc tgtggcgccg gtgatgccgg 4 620 
ccacgatgcg tccggcgtag aggatcgaga tctcgatccc gcgaaattaa tacgactcac 4 680 
tatagggaga ccacaacggt ttccctctag aaataatttt gtttaacttt aagaaggaga 4740 
tatacatatg gctagcatga ctggtggaca gcaaatgggt cgggatccgt cgacggcgcc 4800 
aacccgatcc aagacacccg cgcaggggct ggccagaaag ctgcacttta gcaccgcccc 4860 
cccaaacccc gacgcgccat ggaccccccg ggtggccggc tttaacaagc gcgtcttctg 4920 
cgccgcggtc gggcgcctgg cggccatgca tgcccggatg gcggctgtcc agctctggga 4 980 
catgtcgcgt ccgcgcacag acgaagacct caacgaactc cttggcatca ccaccatccg 5040 
cgtgacggtc tgcgagggca aaaacctgct tcagcgcgcc aacgagttgg tgaatccaga 5100 
cgtggtgcag gacgtcgacg cggccacggc gactcgaggg cgttctgcgg cgtcgcgccc 5160 
caccgagcga cctcgagccc cagcccgctc cgcttctcgc cccagacggc ccgtcgaggg 5220 
taccgagctc ggatccacta gtccagtgtg gtggaattct gcagatatcc agcacagtgg 5280 
cggccgcatg tccaatttac tgaccgtaca ccaaaatttg cctgcattac cggtcgatgc 5340 
aacgagtgat gaggttcgca agaacctgat ggacatgttc agggatcgcc aggcgttttc 54 00 
tgagcatacc tggaaaatgc ttctgtccgt ttgccggtcg tgggcggcat ggtgcaagtt 54 60 
gaataaccgg aaatggtttc ccgcagaacc tgaagatgtt cgcgattatc ttctatatct 5520 
tcaggcgcgc ggtctggcag taaaaactat ccagcaacat ttgggccagc taaacatgct 5580 
tcatcgtcgg tccgggctgc cacgaccaag tgacagcaat gctgtttcac tggttatgcg 5640 
gcggatccga aaagaaaacg ttgatgccgg tgaacgtgca aaacaggctc tagcgttcga 5700 
acgcactgat ttcgaccagg ttcgttcact catggaaaat agcgatcgct gccaggatat 5760 
acgtaatctg gcatttctgg ggattgctta taacaccctg ttacgtatag ccgaaattgc 5820 
caggatcagg gttaaagata tctcacgtac tgacggtggg agaatgttaa tccatattgg 5880 
cagaacgaaa acgctggtta gcaccgcagg tgtagagaag gcacttagcc tgggggtaac 5940 
taaactggtc gag 5953 



<210> 16 
<211> 4727 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
pT7-TACS 

<400> 16 

atccggatat agttcctcct ttcagcaaaa aacccctcaa gacccgttta gaggccccaa 60 

ggggttatgc tagttattgc tcagcggtgg cagcagccaa ctcagcttcc tttcgggctt 120 

tgttagcagc cggatctcag tggtggtggt ggtggtgctc gagtgcggcc gcaagcttat 180 

taaccaccga actgcgggtg acgccaagcg* ctaccacgac cttcgatacc atcgccatct 240 

tccagcaggc gcaccattgc ccctgtttca ctatccaggt tacggatata gttcatgaca 300 

atatttacat tggtccagcc accagcttgc atgatctccg gtattgaaac tccagcgcgg 360 
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gccatatctc gcgcggctcc gacacgggca 
ccagagtcat ccttagcgcc gtaaatcaat 
gcgcgagttg atagctggct ggtggcagat 
caaaacaggt agttattcgg atcatcagct 
agtttagtta cccccaggct aagtgccttc 
gttctgccaa tatggattaa cattctccca 
atcctggcaa tttcggctat acgtaacagg 
ttacgtatat cctggcagcg atcgctattt 
gtgcgttcga acgctagagc ctgttttgca 
atccgccgca taaccagtga aacagcattg 
cgatgaagca tgtttagctg gcccaaatgt 
gcctgaagat atagaagata atcgcgaaca 
ttattcaact tgcaccatgc cgcccacgac 
tgctcagaaa acgcctggcg atccctgaac 
ctcgttgcat cgaccggtaa tgcaggcaaa 
ccgcggcggc gttggcggcg cttcttgcgg 
gttaaacaaa attatttcta gagggaaacc 
atttcgcggg atcgagatct cgggcagcgt 
gctcctgtcg ttgaggaccc ggctaggctg 
atcaccgata cgcgagcgaa cgtgaagcga 
aacaacatga atggtcttcg gtttccgtgt 
gccctgcacc attatgttcc ggatctgcat 
acctacatct gtattaacga agcgctggca 
ccgcatccat accgccagtt gtttaccctc 
atcagtaacc cgtatcgtga gcatcctctc 
cagaaatccc ccttacacgg aggcatcagt 
ggcccgcttt atcagaagcc agacattaac 
ggatgaacag gcagacatct gtgaatcgct 
ctgcctcgcg cgtttcggtg atgacggtga 
ggtcacagct tgtctgtaag cggatgccgg 
gggtgttggc gggtgtcggg gcgcagccat 
tactggctta actatgcggc atcagagcag 
gtgaaatacc gcacagatgc gtaaggagaa 
cgctcactga ctcgctgcgc tcggtcgttc 
aggcggtaat acggttatcc acagaatcag 
aaggccagca aaaggccagg aaccgtaaaa 
tccgcccccc tgacgagcat cacaaaaatc 
caggactata aagataccag gcgtttcccc 
cgaccctgcc gcttaccgga tacctgtccg 
ctcatagctc acgctgtagg tatctcagtt 
gtgtgcacga accccccgtt cagcccgacc 
agtccaaccc ggtaagacac- gacttatcgc 
gcagagcgag gtatgtaggc ggtgctacag 
acactagaag gacagtattt ggtatctgcg 
gagttggtag ctcttgatcc ggcaaacaaa 
gcaagcagca gattacgcgc agaaaaaaag 
cggggtctga cgctcagtgg aacg'aaaact 
caaaaaggat cttcacctag atccttttaa 
gtatatatga gtaaacttgg tctgacagtt 
cagcgatctg tctatttcgt tcatccatag 
cgatacggga gggcttacca tctggcccca 
caccggctcc agatttatca gcaataaacc 
gtcctgcaac tttatccgcc tccatccagt 
gtagttcgcc agttaatagt ttgcgcaacg 
cacgctcgtc gtttggtatg gcttcattca 
catgatcccc catgttgtgc aaaaaagcgg 
gaagtaagtt ggccgcagtg ttatcactca 
ctgtcatgcc atccgtaaga tgcttttctg 
gagaatagtg tatgcggcga ccgagttgct 
cgccacatag cagaacttta aaagtgctca 
tctcaaggat cttaccgctg ttgagatcca 
gatcttcagc atcttttact ttcaccagcg 
atgccgcaaa aaagggaata agggcgacac 
ttcaatatta ttgaagcatt tatcagggtt 



50 

ctgtgtccag accaggccag gtatctctga 420 
cgatgagttg cttcaaaaat cccttccagg 4 80 . 
ggcgcggcaa caccattttt tctgacccgg 540 
acaccagaga cggaaatcca tcgctcgacc 600 
tctacacctg cggtgctaac cagcgttttc 660 
ccgtcagtac gtgagatatc tttaaccctg 720 
gtgttataag caatccccag aaatgccaga 780 
tccatgagtg aacgaacctg gtcgaaatca 840 
cgttcaccgg catcaacgtt ttcttttcgg 900 
ctgtcacttg gtcgtggcag cccggaccga 960 
tgctggatag tttttactgt cagaccgcgc 1020 
tcttcaggtt ctgcgggaaa ccatttccgg 1080 
cggcaaacgg acagaagcat tttccaggta 1140 
atgtccatca ggttcttgcg aacctcatca 1200 
ttttggtgta cggtcagtaa attggacatg 1260 
ccgtagccca tggtatatct ccttcttaaa 1320 
gttgtggtct ccctatagtg agtcgtatta 1380 
tgggtcctgg ccacgggtgc gcatgatcgt 1440 
gcggggttgc cttactggtt agcagaatga 1500 
ctgctgctgc aaaacgtctg cgacctgagc 1560 
ttcgtaaagt ctggaaacgc ggaagtcagc 1620 
cgcaggatgc tgctggctac cctgtggaac 1680 
ttgaccctga gtgatttttc tctggtcccg 17 40 
acaacgttcc agtaaccggg catgttcatc 1800 
tcgtttcatc ggtatcatta cccccatgaa 1860 
gaccaaacag gaaaaaaccg cccttaacat 1920 
gcttctggag aaactcaacg agctggacgc 1980 
tcacgaccac gctgatgagc tttaccgcag 2040 
aaacctctga cacatgcagc tcccggagac 2100 
gagcagacaa gcccgtcagg gcgcgteagc 2160 
gacccagtca cgtagcgata gcggagtgta 2220 
attgtactga gagtgcacca tatatgcggt 2280 
aataccgcat caggcgctct tccgcttcct 2340 
ggctgcggcg agcggtatca gctcactcaa 2400 
gggataacgc aggaaagaac atgtgagcaa 24 60 
aggccgcgtt gctggcgttt ttccataggc 2520 
gacgctcaag tcagaggtgg cgaaacccga 2580 
ctggaagctc cctcgtgcgc tctcctgttc 2640 
cctttctccc ttcgggaagc gtggcgcttt 2700 
cggtgtaggt cgttcgctcc aagctgggct 27 60 
gctgcgcctt atccggtaac tatcgtcttg 2820 
cactggcagc agccactggt aacaggatta- 2880 
agttcttgaa gtggtggcct aactacggct 2940 
ctctgctgaa gccagttacc ttcggaaaaa 3000 
ccaccgctgg tagcggtggt ttttttgttt 3060 
gatctcaaga agatcctttg atcttttcta 3120 
cacgttaagg gattttggtc atgagattat 3180 
attaaaaatg aagttttaaa tcaatctaaa 3240 
accaatgctt aatcagtgag gcacctatct 3300 
ttgcctgact ccccgtcgtg tagataacta 3360 
gtgctgcaat gataccgcga gacccacgct 3420 
agccagccgg aagggccgag cgcagaagtg 3480 
ctattaattg ttgccgggaa gctagagtaa 3540 
ttgttgccat tgctgcaggc atcgtggtgt 3600 
gctccggttc ccaacgatca aggcgagtta 3660 
ttagctcctt cggtcctccg atcgttgtca 3720 
tggttatggc agcactgcat aattctctta 3780 
tgactggtga gtactcaacc aagtcattct 3840 
cttgcccggc gtcaatacgg gataataccg 3900 
tcattggaaa acgttcttcg gggcgaaaac 3960 
gttcgatgta acccactcgt gcacccaact 4020 
tttctgggtg agcaaaaaca ggaaggcaaa 4080 
ggaaatgttg aatactcata ctcttccttt 4140 
attgtctcat gagcggatac atatttgaat 4200 
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gtatttagaa 
aaattgtaaa 
tttttaacca 
tagggttgag 
acgtcaaagg 
aatcaagttt 
cccgatttag 
cgaaaggagc 
cacccgccgc 



aaataaacaa 
cgttaatatt 
ataggccgaa 
tgttgttcca 
gcgaaaaacc 
tttggggtcg 
agcttgacgg 
gggcgctagg 
gcttaatgcg 



ataggggttc 
ttgttaaaat 
atcggcaaaa 
gtttggaaca 
gtctatcagg 
aggtgccgta 
ggaaagccgg 
gcgctggcaa 
ccgctacagg 



cgcgcacatt 
tcgcgttaaa 
tcccttataa 
agagtccact 
gcgatggccc 
aagcactaaa 
cgaacgtggc 
gtgtagcggt 
gcgcgtccca 



tccccgaaaa 
tttttgttaa 
atcaaaagaa 
attaaagaac 
actacgtgaa 
tcggaaccct 
gagaaaggaa 
cacgctgcgc 
ttcgcca 



gtgccacctg 
atcagctcat 
tagaccgaga 
gtggactcca 
ccatcaccct 
aaagggagcc 
gggaagaaag 
gtaaccacca 



4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4727 



<210> 17 
<211> 4488 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
pT7-VPCS 

<400> 17 

aaatcaatct aaagtatata tgagtaaact tggtctgaca gttaccaatg cttaatcagt 60 
gaggcaccta tctcagcgat ctgtctattt cgttcatcca tagttgcctg actccccgtc 120 
gtgtagataa ctacgatacg ggagggctta ccatctggcc ccagtgctgc aatgataccg 180 
cgagacccac gctcaccggc tccagattta tcagcaataa accagccagc cggaagggcc 240 
gagcgcagaa gtggtcctgc aactttatcc gcctccatcc agtctattaa ttgttgccgg 300 
gaagctagag taagtagttc gccagttaat agtttgcgca acgttgttgc cattgctaca 360 
ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat tcagctccgg ttcccaacga 420 
tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag cggttagctc cttcggtcct 480 
ccgatcgttg tcagaagtaa gttggccgca gtgttatcac tcatggttat ggcagcactg 540 
cataattctc ttactgtcat gccatccgta agatgctttt ctgtgactgg tgagtactca 600 • 
accaagtcat tctgagaata gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaaca 660 
cgggataata ccgcgccaca tagcagaact ttaaaagtgc tcatcattgg aaaacgttct 720 
tcggggcgaa aactctcaag, gatcttaccg ctgttgagat ccagttcgat gtaacccact 780 
cgtgcaccca actgatcttc agcatctttt actttcacca gcgtttctgg gtgagcaaaa 840 
acaggaaggc aaaatgccgc aaaaaaggga ataagggcga cacggaaatg ttgaatactc 900 
atactcttcc tttttcaata ttattgaagc atttatcagg gttattgtct catgagcgga 960 
tacatatttg aatgtattta gaaaaataaa caaatagggg ttccgcgcac atttccccga 1020 
aaagtgccac ctgacgtcta agaaaccatt attatcatga cattaaccta taaaaatagg 1080 
cgtatcacga ggccctttcg tcttcaagaa ttaaaaggat ctaggtgaag atcctttttg 1140 
ataatctcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg 1200 
tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc 1260 
aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc 1320 
tttttccgaa ggrtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt 1380 
agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc 1440 
taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact 1500 
caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac 1560 
agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt gagctatgag 1620 
aaagcgccac gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg 1680 
gaacaggaga gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg 1740 
tcgggtttcg ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga 1800 
gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt 1860 
ttgctcacat gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct 1920 
ttgagtgagc tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg 1980 
aggaagcgga agagcgcctg atgcggtatt ttctccttac gcatctgtgc ggtatttcac 2040 
accgcatcag atctgatggt gcactctcag tacaatctgc tctgatgccg catagttaag 2100 
ccagtatata cactccgcta tcgctacgtg actgggtcat ggctgcgccc cgacacccgc 2160 
caacacccgc tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag 2220 
ctgtgaccgt ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg 2280 
cgaggcccag cgattcgaac ttctgataga cttcgaaatt aatacgactc actataggga 2340 
gaccacaacg gtttccctct agaaataatt ttgtttaact ttaagaagga gatatacata 2400 
tgacctctcg ccgctccgtg aagtcgggtc' cgcgggaggt tccgcgcgat gagtacgagg 2460 
atctgtacta caccccgtct tcaggtatgg cgagtcccga tagtccgcct gacacctccc 2520 
gccgtggcgc cctacagaca cgctcgcgcc agaggggcga ggtccgtttc gtccagtacg 2580 
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acgagtcgga 
tcccccggac 
cgcctccgcc 
ccccccgaac 
gcggcaggaa 
cgccaacccg 
cccccccaaa 
tctgcgccgc 
gggacatgtc 
tccgcgtgac 
cagacgtggt 
gccccaccga 
agggtaccga 
gtggcggccg 
atgcaacgag 
tttctgagca 
agttgaataa 
atcttcaggc 
tgcttcatcg 
tgcggcggat 
tcgaacgcac 
atatacgtaa 
ttgccaggat 
ttggcagaac 
taactaaact 
acctgttttg 
caactcgcgc 
atgactctgg 
gagatatggc 
atgtaaatat 
gcctgctgga 
tcggtggtta 



ttatgccctc 
gcggcgtccc 
acccgctggg 
ccagcgggtg 
atcggcccag 
atccaagaca 
ccccgacgcg 
ggtcgggcgc 
gcgtccgcgc 
ggtctgcgag 
gcaggacgtc 
gcgacctcga 
gctcggatcc 
catgtccaat 
tgatgaggtt 
tacctggaaa 
ccggaaatgg 
gcgcggtctg 
tcggtccggg 
ccgaaaagaa 
tgatttcgac 
tctggcattt 
cagggttaaa 
gaaaacgctg 
ggtcgagcga 
ccgggtcaga 
cctggaaggg 
tcagagatac 
ccgcgctgga 
tgtcatgaac 
agatggcgat 
ataagcttat 



tacgggggct 
gtttccgggg 
tccggagggg 
gcgtctaagg 
ccagaatccg 
cccgcgcagg 
ccatggaccc 
ctggcggcca 
acagacgaag 
ggcaaaaacc 
gacgcggcca 
gccccagccc 
actagtccag 
ttactgaccg 
cgcaagaacc 
atgcttctgt 
tttcccgcag 
gcagtaaaaa 
ctgccacgac 
aacgttgatg 
caggttcgtt 
ctggggattg 
gatatctcac 
gttagcaccg 
tggatttccg 
aaaaatggtg 
atttttgaag 
ctggcctggt 
gtttcaatac 
tatatccgta 
ggtatcgaag 
cgatgataag 



cgtcttccga 
cggttttgtc 
ccggacgcac 
cccccgcggc 
ccgcactccc 
ggctggccag 
cccgggtggc 
tgcatgcccg 
acctcaacga 
tgcttcagcg 
cggcgactcg 
gctccgcttc 
tgtggtggaa 
tacaccaaaa 
tgatggacat 
ccgtttgccg 
aacctgaaga 
ctatccagca 
caagtgacag 
ccggtgaacg 
cactcatgga 
cttataacac 
gtactgacgg 
caggtgtaga 
tctctggtgt 
ttgccgcgcc 
caactcatcg 
ctggacacag 
cggagatcat 
acctggatag 
gtcgtggtag 
ctgtcaaaca 



agacgacgaa 
cggcccgggg 
acccaccacc 
cccggcggcg 
agacgccccc 
aaagctgcac 
cggctttaac 
gatggcggct 
actccttggc 
cgccaacgag 
agggcgttct 
tcgccccaga 
ttctgcagat 
tttgcctgca 
gttcagggat 
gtcgtgggcg 
tgttcgcgat 
acatttgggc 
caatgctgtt 
tgcaaaacag 
aaatagcgat 
cctgttacgt 
tgggagaatg 
gaaggcactt 
agctgatgat 
atctgccacc 
attgatttac 
tgcccgtgtc 
gcaagctggt 
tgaaacaggg 
cgcttggcgt 
tgagaatt 



cacccggagg 
cctgcgcggg 
gccccccggg 
gagaccaccc 
gcgtcgacgg 
tttagcaccg 
aagcgcgtct 
gtccagctct 
atcaccacca 
ttggtgaatc 
gcggcgtcgc 
cggcccgtcg 
atccagcaca 
ttaccggtcg 
cgccaggcgt 
gcatggtgca 
tatcttctat 
cagctaaaca 
tcactggtta 
gctctagcgt 
cgctgccagg 
atagccgaaa 
ttaatccata. 
agcctggggg 
ccgaataact 
agccagctat 
ggcgctaagg 
ggagccgcgc 
ggctggacca 
gcaatggtgc 
cacccgcagt 



2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3B40 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4488 



<210> 18 
<211> 1125 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TATcreStrepTag 



<220> 
<221> CDS 
<222> (1). 



(1119) 



<400> 18 

atg ggc tac ggc cgc aag aag cgc cgc caa cgc cgc cgc ggc atg tec 48 

Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
15 10 15 

aat tta ctg acc gta cac caa aat ttg cct gca tta ccg gtc gat gca 96 

Asn Leu Leu Thr Val His Gin Asn Leu Pro Ala. Leu Pro Val Asp Ala 
20 25 30 

acg agt gat gag gtt cgc aag aac ctg atg gac atg ttc agg gat cgc 144 

Thr Ser Asp Glu Val Arg Lys Asn Leu Met Asp Met Phe Arg Asp Arg 
35 40 45 



cag gcg ttt tct gag cat acc tgg aaa atg ctt ctg tec' gtt tgc egg 
Gin Ala Phe Ser Glu His Thr Trp Lys Met Leu Leu Ser Val Cys Arg 
50 .55 60 
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teg tgg gcg gca tgg tgc aag ttg aat aac egg aaa tgg ttt ccc gca 240 
Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn Arg Lys Trp Phe Pro Ala 
65 70 75 80 

gaa cet gaa gat gtt cgc gat tat ctt eta tat ctt cag gcg cgc ggt 288 
Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu Tyr Leu Gin Ala Arg Gly 
85 90 95 

ctg aea gta aaa act ate cag caa cat ttg ggc cag eta aac atg ctt 336 
Leu Thr Val Lys Thr lie Gin Gin His Leu Gly Gin Leu Asn Met Leu 
100 105 110 

cat cgt egg tec ggg ctg cea cga eea agt gac age aat get gtt tea 384 
His Arg Arg Ser Gly Leu Pro Arg Pro Ser Asp Ser Asn Ala Val Ser 
115 120 125 

ctg gtt atg egg egg ate cga aaa gaa aac gtt gat gee ggt gaa cgt 432 
Leu Val Met Arg Arg lie Arg Lys Glu Asn Val Asp Ala Gly Glu Arg 
130 135 140 

gca aaa cag get eta gcg ttc gaa cgc act gat ttc gac cag gtt cgt 480 
Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr Asp Phe Asp Gin Val Arg 
145 150 155 160 

tea etc atg gaa aat age gat cgc tgc cag gat ata cgt aat ctg gca 528 
Ser Leu Met Glu Asn Ser Asp Arg Cys Gin Asp He Arg Asn Leu Ala 
165 170 * 175 



ttt ctg ggg att get tat aac ace ctg tta cgt ata gcc gaa att gee 
Phe Leu Gly He Ala Tyr Asn Thr Leu Leu Arg lie Ala Glu He Ala 
180 185 190 

agg ate agg gtt aaa gat ate tea cgt act gac ggt ggg aga atg tta 
Arg He Arg Val Lys Asp He Ser Arg Thr Asp Gly Gly Arg Met Leu 
195 200 205 

ate cat att ggc aga acg aaa acg ctg gtt age acc gca ggt gta gag 
He His lie Gly Arg Thr Lys Thr Leu Val Ser Thr Ala Gly Val Glu 
210 215 220 

aag gea ctt age ctg ggg gta act aaa ctg gtc gag cga tgg att tec 
Lys Ala Leu Ser Leu Gly Val Thr Lys Leu Val Glu Arg Trp He Ser 
225 230 235 240 



576 



624 



672 



720 



gtc tct ggt gta get gat gat ccg aat aac tac ctg ttt tgc egg gte 768 
Val Ser Gly Val Ala Asp Asp Pro Asn Asn Tyr Leu Phe Cys Arg Val 
245 250 255 

aga aaa aat ggt gtt gcc gcg cea tct gcc acc age cag eta tea act 816 
Arg Lys Asn Gly Val Ala Ala Pro Ser Ala Thr Ser Gin Leu Ser Thr 
260 265 270 

cgc gcc ctg gaa ggg att ttt gaa gca act cat cga ttg att tac ggc 864 
Arg Ala Leu Glu Gly He Phe Glu Ala Thr His Arg Leu He Tyr Gly 
275 280 285 

get aag gat gac tct ggt cag aga tac ctg gcc tgg tct gga cac agt 912 
Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu Ala Trp Ser Gly His Ser 
290 295 300 



gee cgt gtc gga gcc gcg cga gat atg gcc cgc get gga gtt tea ata 
Ala Arg Val Gly Ala Ala Arg Asp Met Ala Arg Ala Gly Val Ser He 
305 310 315 320 



960 



SUBSTITUTE SHEET (RULE 26) 



wo 01/49832 PCT/EPO 1/00060 

54 

ccg gag ate atg caa get ggt ggc tgg acc aat gta aat att gtc atg 1008 
Pro Glu lie Met Gin Ala Gly Gly Trp Thr Asn Val Asn lie Val Met 
325 330 335 

aac tat ate cgt aac ctg gat agt gaa aca ggg gca atg gtg cgc ctg 1056 
Asn Tyr lie Arg Asn Leu Asp Ser Glu Thr Gly Ala Met Val Arg Leu 
340 345 350 

ctg gaa gat ggc gat ggt ate gaa ggt egt ggt age get tgg cgt cae 1104 
Leu Glu Asp Gly Asp Gly lie Glu Gly Arg Gly Ser Ala Trp Arg His 
355 360 365 

ecg cag ttc ggt ggt taataa 1125 
Pro Gin Phe Gly Gly 
370 

<210> 19 
<211> 373 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TATcreStrepTag 

<400> 19 

Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
1 5 10 15 

Asn Leu Leu Thr Val His Gin Asn Leu Pro Ala Leu Pro Val Asp Ala 
20 25 30 

Thr Ser Asp Glu Val Arg Lys Asn Leu Met Asp Met Phe Arg Asp Arg 
35 40 45 

Gin Ala Phe Ser Glu His Thr Trp Lys Met Leu Leu Ser Val Cys Arg 
50 55 60 

Ser Trp Ala Ala Trp . Cys Lys Leu Asn Asn Arg Lys Trp Phe Pro Ala 
65 70 75 80 

Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu Tyr Leu Gin Ala Arg Gly 
85 90 95 

Leu Thr Val Lys Thr lie Gin Gin His Leu Gly Gin Leu Asn Met Leu 
100 105 110 

His Arg Arg Ser Gly Leu Pro Arg Pro Ser Asp Ser Asn Ala Val Ser 
115 120 125 

Leu Val Met Arg Arg lie Arg Lys Glu Asn Val Asp Ala Gly Glu TVrg 
130 135 140 

Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr Asp Phe Asp Gin Val Arg 
145 150 155 160 

Ser Leu Met Glu Asn Ser Asp Arg Cys Gin Asp lie Arg Asn Leu Ala 
165 170 175 

Phe Leu Gly lie Ala Tyr Asn Thr Leu Leu Arg lie Ala Glu lie Ala 
180 185 190 

Arg lie Arg Val Lys Asp lie Ser Arg Thr Asp Gly Gly Arg Met Leu 
195 200 205 
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He His He Gly Arg Thr Lys Thr Leu Val Ser Thr Ala Gly Val Glu 
210 215 220 

Lys Ala Leu Ser Leu Gly Val Thr Lys Leu Val Glu Arg Trp He Ser 
225 230 235 240 

Val Ser Gly Val Ala Asp Asp Pro Asn Asn Tyr Leu Phe Cys Arg Val 
245 250 255 

Arg Lys Asn Gly Val Ala Ala Pro Ser Ala Thr Ser Gin Leu Ser Thr 
260 265 270 

Arg Ala Leu Glu Gly He Phe Glu Ala Thr His Arg Leu lie Tyr Gly 
275 280 * 285 

Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu Ala Trp Ser Gly His Ser 
290 295 300 

Ala Arg Val Gly Ala Ala Arg Asp Met Ala Arg Ala Gly Val Ser He 
305 310 315 320 

Pro Glu He Met Gin Ala Gly Gly Trp Thr Asn Val Asn He Val Met 
325 330 335 

Asn Tyr He Arg Asn Leu Asp Ser Glu Thr Gly Ala Met Val Arg Leu 
340 345 350 

Leu Glu Asp Gly Asp Gly He Glu Gly Arg Gly Ser Ala Trp Arg His 
355 360 365 

Pro Gin Phe Gly Gly 
370 



<210> 20 
<211> 2055 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22creStrepTag 

<220> 

<221> CDS 

<222> (1) . . (2049) 

<400> 20 

atg acc tct cgc cgc tec gtg aag teg ggt ccg egg gag gtt ccg cgc 48 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
1 5 10 ' 15 

gat gag tac gag gat ctg tac tac acc ccg tct tea ggt atg gcg agt 96 
Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
20 25 30 

ccc gat agt ccg cct gac acc tec cgc cgt gge gcc eta cag aca cgc 144 
Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 40 45 

teg cgc cag agg gge gag gte cgt tte gtc cag tac gac gag teg gat 192 
Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
50 55 60 
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tat gcc etc tac ggg ggc teg tct tec gaa gac gac gaa cac ccg gag 24 0 

Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Gin Asp Asp Glu His Pro Glu 
65 70 75 80 

gtc ccc egg acg egg cgt ccc gtt tec ggg gcg gtt ttg tec gge ccg 288 
Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 90 95 

ggg cet gcg egg gcg cet ccg cca ccc get ggg tec gga ggg gcc gga 336 

Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 
100 105 110 

' ■ ■,'( 

egc aea ecc ace ace gcc ccc egg gcc eee ega ace cag egg gtg gcg 384 

Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 
115 120 125 

t.et aag gcc ece gcg gcc ccg gcg gcg gag acc ace cge ggc agg aaa 432 
Ser Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lys 
130 135 140 

teg gcc.eag cca gaa tea gee gca etc cca gac gee eee gcg teg acg 480 
Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 
145 150 155 160 

gcg cca acc ega tec aag aea ece gcg eag ggg etg gee aga aag ctg 528 
Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 170 175 

cac ttt age acc gee ecc cca aac ecc gac gcg cea tgg ace eee egg 576 
His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 190 

gtg gcc gge ttt aac aag cge gtc ttc tgc gcc gcg gtc ggg egc ctg 624 
Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
195 200 205 

gcg gcc atg cat gcc egg atg geg get gtc cag etc tgg gac atg teg 672 
Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
210 215 220 

cgt ccg cge aea gac gaa gac etc aac gaa etc ctt ggc ate acc acc 720 
Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
225 - 230 235 240 

ate egc gtg acg gtc tgc gag ggc aaa aac etg ctt eag cge gee aac 7 68 
lie T^g Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 250 255 

gag ttg gtg aat cea gac gtg gtg cag gac gtc gac geg gee acg gcg 816 
Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 

act ega ggg cgt tct geg gcg teg cge ccc acc gag ega cet ega gcc 8 64 
Thr Arg Gly Arg Ser Ala Ala Ser Arg* Pro Thr Glu Arg Pro Arg Ala 
275 280 285 

cca gcc egc tec get tct egc ccc aga egg ecc gtc gag ggt acc gag 912 
Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 300 



etc gga tec act agt cea gtg tgg tgg aat tct gca gat ate eag cac 
Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp lie Gin His 
305 310 315 320 
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agt ggc ggc cgc atg tec aat tta ctg acc gta cac caa aat ttg cct 1008 

Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
325 330 335 

gca tta ccg gtc gat gca acg agt gat gag gtt cgc aag aac ctg atg 1056 
Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
340 . 345 350 

gac atg ttc agg gat cgc cag gcg ttt tct gag cat acc tgg aaa atg 1104 
Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His Thr Trp Lys Met 
355 360 365 

. L.J 

ctt ctg tec gtt tgc egg teg tgg gcg gca tgg tgc aag ttg aat aac 1152 
Leu Leu Ser Val Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn 
370 375 380 

egg aaa tgg ttt ccc gca gaa cct gaa gat gtt cgc gat tat ctt eta 1200 
Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu 
385 390 395 400 

tat ctt eag gcg cgc ggt ctg gca gta aaa act ate cag caa eat ttg 1248 
Tyr Leu Gin Ala Arg Gly Leu Ala Val Lys Thr lie Gin Gin His Leu 
405 410 415 



ggc cag eta aac atg ctt cat cgt egg tec ggg ctg cca cga eca agt 1296 
Gly Gin Leu. Asn Met Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser 
420 425 430 



gac age aat get gtt tea ctg gtt atg egg egg ate cga aaa gaa aac 1344 

Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg He Arg Lys Glu Asn 
435 440 445 

gtt gat gee ggt gaa cgt gea aaa cag get eta geg ttc gaa cgc act 1392 

Val Asp Ala Gly Glu Arg Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr 
450 455 460 

gat ttc gac cag gtt cgt tea etc atg gaa aat age gat egc tgc cag 1440 

Asp Phe Asp Gin Val Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gin 
465 470 475 480 

gat ata cgt aat ctg gea ttt ctg ggg att get tat aac acc ctg tta 1488 

Asp He Arg Asn Leu Ala Phe Leu Gly He Ala Tyr Asn Thr Leu Leu 

485 490 495 

cgt ata gcc gaa att gcc agg ate agg gtt aaa gat ate tea cgt act 1536 

Arg lie Ala Glu He Ala Arg He Arg Val Lys Asp He Ser Arg Thr 
500 505 510 

gac ggt ggg aga atg tta ate cat att ggc aga acg aaa acg ctg gtt 1584 

Asp Gly Gly Arg Met Leu He His He Gly Arg Thr Lys Thr Leu Val 
515 520 525 

age ace gea ggt gta gag aag gea ctt age ctg ggg gta act aaa etg 1632 
Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu 
530 535 540 

gte gag cga tgg att tec gtc tct ggt gta get gat gat ccg aat aac 1680 
Val Glu Arg Trp He Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 
545 550 555 560 



tac ctg ttt tgc egg gte aga aaa aat ggt gtt gee geg cca tct gcc 
Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 



1728 
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acc age cag eta tea act cgc gee ctg gaa ggg att ttt gaa gca aet 1776 
Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly lie Phe Glu Ala Thr 
580 585 590 

cat cga ttg att tac ggc get aag gat gac tet ggt cag aga tae ctg 1824 
His Arg Leu He Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 
595 600 605 

gcc tgg tot gga cac agt gcc cgt gtc gga gcc gcg cga gat atg gee 1872 

Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 
610 615 620 

i _! 

cgc get gga gtt tea ata ccg gag ate atg caa get ggt ggc tgg acc 1920 

Arg Ala Gly Val Ser He Pro Glu He Met Gin Ala Gly Gly Trp Thr 

625 630 635 640 

aat gta aat att gtc atg aac tat ate cgt aac ctg gat agt gaa aca 1968 
Asn Val Asn He Val Met Asn Tyr He Arg Asn Leu Asp Ser Glu Thr 
645 650 655 

ggg gca atg gtg cgc ctg ctg gaa gat ggc gat ggt ate gaa ggt cgt 2016 
Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp Gly He Glu Gly Arg 
660 665 670 

ggt age get tgg cgt cac ccg cag ttc ggt ggt taataa 2055 
Gly Ser Ala Trp Arg His Pro Gin Phe Gly Gly 
675 680 



<210> 21 
<211> 683 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22creStrepTag 

<400> 21 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
1 5 10^ 15 

Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
20 25 30 

Pro Asp Ser Pro Pro Asp Thr Ser TVrg Arg Gly Ala Leu Gin Thr Arg 
35 40 -45 

Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
50 55 60 

Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 BO 

Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 90 95 

Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 
100 105 110j~*» 

Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 
115 120 125 

Ser Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lys 
130 135 140 
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Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 
145 150 155 160 

Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 170 175 

His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 190 

Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
195 200 205 

I I 

Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
210 215 220 

Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
225 230 235 . 240 

lie Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 '250 255 

Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 

Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
275 280 285 

Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 300 

Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp lie Gin His 
305 ' 310 315 320 

Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
325 330 335 

Ala Leu' Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
340 345 350 

Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His Thr Trp Lys Met 
355 360 365 

Leu Leu Ser Val Cys Arg Ser. Trp Ala Ala Trp Cys Lys Leu Asn Asn 
370 375 380 

Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu 
385 390 395 400 

Tyr Leu Gin Ala Arg Gly Leu Ala Val Lys. Thr lie Gin Gin His Leu 
405 410 415 

Gly Gin Leu Asn Met Leu His Arg hrg Ser Gly Leu Pro Arg Pro Ser 
420 425 430 

Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg lie Arg Lys Glu Asn 
435 440 445 



Val Asp Ala Gly Glu Arg Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr 

450 455 460 

Asp Phe Asp Gin Val Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gin 

465 470 475 480 
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Asp lie Arg Asn Leu Ala Phe Leu Gly lie Ala Tyr Asn Thr Leu Leu 
485 490 495 

Arg lie Ala Glu lie Ala Arg lie Arg Val Lys Asp lie Ser Arg Thr 
500 505 510 

Asp Gly Gly Arg Met Leu lie His lie Gly Arg Thr Lys Thr Leu Val 
515 520 • 525 

Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu 
530 535 540 _ _ 

Val Glu Arg Trp lie Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 
545 550 555 560 

Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 
565 570 575 

Thr Ser Gin Leu Ser Thr Arg Ala Leu, Glu Gly lie Phe Glu Ala Thr 
580 585 590 

His Arg Leu lie tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 
595 600 605 

Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 
610 615 620 

Arg Ala Gly Val Ser lie Pro Glu He Met Gin Ala Gly Gly Trp Thr 
625 630 635 640 

Asn Val Asn He Val Met Asn Tyr He Arg Asn Leu Asp Ser Glu Thr 
645 650 655 

Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp Gly lie Glu Gly Arg 
6,60 665 670 

Gly Ser Ala Trp Arg His Pro Gin Phe Gly Gly 
675 680 



<210> 22 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein. 

<400> 22 

Ala Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg 
15 10 



<210> 23 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 
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<400> 23 

Tyr Ala Arg Lys Ala Arg Arg Gin Ala Arg Arg 
15 10 



<210> 24 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 

<400> 24 

Tyr Ala Arg Ala Ala Ala Arg Gin Ala Arg Ala 
1 5 ' 10 



<210> 25 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 

<400> 25 

Tyr Ala Arg Ala Ala Arg Arg Ala Ala Arg Arg 
15 10 



<210> 26 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 

<400> 26 

Tyr Ala Arg Ala Ala Arg Arg Ala Ala Arg Ala 
1 5 " 10 



<210> 27 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 

<400> 27 

Tyr Ala Arg Arg Arg Arg Arg Arg Arg Arg Arg 
15 10 



<210> 28 
<211> 11 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 

<400> 28 

Tyr Ala Ala Ala Arg Arg Arg Arg Arg Arg Arg 
1 5 10 



<210> 29 
<211> 4960 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
pCMV-I-Cre-pA 

<400> 29 

aaacagtccg atgtacgggc cagatatacg cgttgacatt gattattgac tagttattaa 60 
tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg cgttacataa 120 
cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt gacgtcaata 180 
atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca atgggtggac 240 
tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc aagtacgccc 300 
cctattgacg tcaatgacgg taaatggccc gcctggcatt atgpccagta catgacctta 360 
tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac catggtgatg 420 
cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg atttccaagt 480 
ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg ggactttcca 540 
aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt acggtgggag 600 
gtctatataa gcagagctct ctggctaact agagaaccca ctgcttactg gcttatcgaa 660 
attaatacga ctcactatag ggagacccaa gctgactcta gacttaatta agcgttgggg 720 
tgagtactcc ctctcaaaag cgggcatgac ttctgcgcta agattgtcag tttccaaaaa 780 
cgaggaggat ttgatattca cctggcccgc ggtgatgcct ttgagggtgg ccgcgtccat 840 
ctggtcagaa aagacaatct ttttgttgtc aagcttgagg tgtggcaggc ttgagatctg 900 
gccatacact tgagtgacat tgacatccac tttgcctttc tctccacagg tgtccactcc 960 
cagggcggcc tcgaccatgc ccaagaagaa gaggaaggtg tccaatttac tgaccgtaca 1020 
ccaaaatttg cctgcattac cggtcgatgc aacgagtgat gaggttcgca agaacctgat 1080 
ggacatgttc agggatcgcc aggcgttttc tgagcatacc tggaaaatgc ttctgtccgt 1140 
ttgccggtcg tgggcggcat ggtgcaagtt gaataaccgg aaatggtttc ccgcagaacc 1200 
tgaagatgtt cgcgattatc ttctatatct tcaggcgcgc ggtctggcag taaaaactat 1260 
ccagcaacat ttgggccagc taaacatgct tcatcgtcgg tccgggctgc cacgaccaag 1320 
tgacagcaat gctgtttcac tggttatgcg gcggatccga aaagaaaacg ttgatgccgg 1380 
tgaacgtgca aaacaggctc tagcgttcga acgcactgat ttcgaccagg ttcgttcact 1440 
catggaaaat agcgatcgct gccaggatat acgtaatctg gcatttctgg ggattgctta 1500 
taacaccctg ttacgtatag ccgaaattgc caggatcagg gttaaagata tctcacgtac 1560 
tgacggtggg agaatgttaa tccatattgg cagaacgaaa acgctggtta gcaccgcagg 1620 
tgtagagaag gcacttagcc tgggggtaac taaactggtc gagcgatgga tttccgtctc 1680 
tggtgtagct gatgatccga ataactacct gttttgccgg gtcagaaaaa atggtgttgc 1740 
cgcgccatct gccaccagcc agctatcaac tcgcgccctg gaagggattt ttgaagcaac' 1800 
tcatcgattg atttacggcg ctaaggatga ctctggtcag agatacctgg cctggtctgg 1860 
acacagtgcc cgtgtcggag ccgcgcgaga tatggcccgc gctggagttt caataccgga 1920 
gatcatgcaa gctggtggct ggaccaatgt aaatattgtc atgaactata tccgtaacct 1980 
ggatagtgaa acaggggcaa tggtgcgcct gctggaagat ggcgattagc cattaacgcg 2040 
taaatgattg cagatccact agttctaggg ccgcgtcgac ctcgagatcc aggcgcggat 2100 
caataaaaga tcattatttt caatagatct gtgtgttggt tttttgtgtg ccttggggga 2160 
gggggaggcc agaatgaggc gcggccaagg gggaggggga ggccagaatg accttggggg 2220 
agggggaggc cagaatgacc ttgggggagg gggaggccag aatgaggcgc gcccccgggt 2280 
accgagctcg aattcactgg ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt 2340 
tacccaactt aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga 2400 
ggcccgcacc gatcgccctt cccaacagtt 'gcgcagcctg aatggcgaat ggcgcctgat 24 60 
gcggtatttt ctccttacgc atctgtgcgg tatttcacac cgcatatggt gcactctcag 2520 
tacaatctgc tctgatgccg catagttaag ccagccccga cacccgccaa cacccgctga 2580 
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cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc 2640 
cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga gacgaaaggg 27 00 
cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt cttagacgtc 2760 
aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca 2820 
ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa 2880 
aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt 2940 
ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca 3000 
gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag 3060 
ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc 3120 
ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac actattctca 3180 
gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg gcatgacagt 3240 
aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca, acttacttct 3300 
gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt 3360 
aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga 3420 
caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact 34 80 
tactctagct tcccggcaac aattaataga ctggatggag gcggataaag ttgcaggacc 3540 
acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg gagccggtga 3600 
gcgtgggtct cgcggtatca ttgcagcact ggggccagaf ggtaagccct cccgtatcgt 3660 
agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac agatcgctga 3720 
gataggtgcc tcactgatta. agcattggta actgtcagac caagtttact catatatact 3780 
ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga 3840 
taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccgt 3900 
agaaaagatc aaaggatctt cttgagatcc tttttttctg . cgcgtaatct gctgcttgca 3960 
aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct 4020 
ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc ttctagtgta 4080 
gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct 4140 
aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc 4200 
aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca 4260 
gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga 4320 
aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg 4380 
aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt 4440 
cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag 4500 
cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt 4560 
tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt 4620 
tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga 4680 
ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc cgattcatta 4740 
atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca acgcaattaa 4800 
tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc cggctcgtat 4860 
gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg accatgatta 4920 
cgccaagcta gcccgggcta gcttgcatgc ctgcaggttt 4960 



<210> 30 
<211> 7332 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; vector 
pCMV-I-beta-pA 

<400> 30 

aaacagtccg atgtacgggc 
tagtaatcaa ttacggggtc 
cttacggtaa atggcccgcc 
atgacgtatg ttcccatagt 
tatttacggt aaactgccca 
cctattgacg tcaatgacgg 
tgggactttc ctacttggca 
cggttttggc agtacatcaa 
ctccacccca ttgacgtcaa 
aaatgtcgta acaactccgc 
gtctatataa gcagagctct 
attaatacga ctcactatag 



cagatatacg 
attagttcat 
tggctgaccg 
aacgccaata 
cttggcagta 
taaatggccc 
gtacatctac 
tgggcgtgga 
tgggagtttg 
cccattgacg 
ctggctaact 
ggagacccaa 



cgttgacatt 
agcccatata 
cccaacgacc 
gggactttcc 
catcaagtgt 
gcctggcatt 
gtattagtca 
tagcggtttg 
ttttggcacc 
caaatgggcg 
agagaaccca 
gctgactcta 



gattattgac 
tggagttccg 
cccgcccatt 
attgacgtca 
atcatatgcc 
atgcccagta 
tcgctattac 
actcacgggg 
aaaatcaacg 
gtaggcgtgt 
ctgcttactg 
gacttaatta 



tagttattaa 60 
cgttacataa 120 
gacgtcaata 180 
atgggtggac 240 
aagtacgccc 300 
catgacctta 360 
catggtgatg 420 
atttccaagt 480 
ggactttcca 540 
acggtgggag 600 
gcttatcgaa 660 
agcgttgggg 720 
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tgagtactcc ctctcaaaag cgggcatgac 
cgaggaggat ttgatattca cctggcccgc 
ctggtcagaa aagacaatct ttttgttgtc 
gccatacact tgagtgacat tgacatccac 
cagggcggcc gcaattcccg gggatcgaaa 
tgtcgtttac tttgaccaac aagaacgtga 
tggacaccag caaggagctg ctcaagcgcg 
aaaaccctgg cgttacccaa cttaatcgcc 
gtaatagcga agaggcccgc accgatcgcc 
aatggcgctt tgcctggttt ccggcaccag 
atcttcctga ggccgatact gtcgtcgtcc 
cgcccatcta caccaacgta acctatccca 
agaatccgac gggttgttac tcgctcacat 
gccagacgcg aattattttt gatggcgtta 
gctgggtcgg ttacggccag gacagtcgtt 
tacgcgccgg agaaaaccgc ctcgcggtga 
tggaagatca ggatatgtgg cggatgagcg 
aaccgactac acaaatcagc gatttccatg 
gcgctgtact ggaggctgaa gttcagatgt 
cagtttcttt atggcagggt gaaacgcagg 
aaattatcga tgagcgtggt ggttatgccg 
acccgaaact gtggagcgcc gaaatcccga 
ccgccgacgg cacgctgatt gaagcagaag 
ttgaaaatgg tctgctgctg ctgaacggca 
acgagcatca tcctctgcat ggtcaggtca 
tgctgatgaa gcagaacaac tttaacgccg 
tgtggtacac gctgtgcgac cgctacggcc 
cccacggcat ggtgccaatg aatcgtctga 
gcgaacgcgt aacgcgaatg gtgcagcgcg 
cgctggggaa tgaatcaggc cacggcgcta 
ctgtcgatcc ttcccgcccg gtgcagtatg 
atattatttg cccgatgtac gcgcgcgtgg 
aatggtccat caaaaaatgg ctttcgctac 
aatacgccca cgcgatgggt aacagtcttg 
gtcagtatcc ccgtttacag ggcggcttcg 
aatatgatga aaacggcaac ccgtggtcgg 
acgatcgcca gttctgtatg aacggtctgg 
tgacggaagc aaaacaccag cagcagtttt 
aagtgaccag cgaatacctg ttccgtcata 
cgctggatgg taagccgctg gcaagcggtg 
aacagttgat tgaactgcct gaactaccgc 
cagtacgcgt agtgcaaccg aacgcgaccg 
ggcagcagtg gcgtctggcg gaaaacctca 
tcccgcatct gaccaccagc gaaatggatt 
aatttaaccg ccagtcaggc tttctttcac 
tgacgccgct gcgcgatcag ttcacccgtg 
aagcgacccg cattgaccct aacgcctggg 
aggccgaagc agcgttgttg cagtgcacgg 
cgaccgctca cgcgtggcag catcagggga 
ggattgatgg tagtggtcaa atggcgatta 
cgcatccggc gcggattggc ctgaactgcc 
ggctcggatt agggccgcaa gaaaactatc 
gctgggatct gccattgtca gacatgtata 
tgcgctgcgg gacgcgcgaa ttgaattatg 
tcaacatcag ccgctacagt caacagcaac 
acgcggaaga aggcacatgg ctgaatatcg 
actcctggag cccgtcagta tcggcggaat 
agttggtctg gtgtcaaaaa taataataac 
taaggaaatc cattatgtac tatttaaaaa 
ttttctttta cttttttatc atgggagcct 
acatcaacca tatcagcaaa agtgatacgg 
cgctattatt ccaaccgctg tttggtctgc 
ggccgcgtcg acctcgagat ccaggcgcgg 
ctgtgtgttg gttttttgtg tgccttgggg 



64 

ttctgcgcta agattgtcag tttccaaaaa 780 
ggtgatgcct ttgagggtgg ccgcgtccat 840 
aagcttgagg tgtggcaggc ttgagatctg 900 
tttgcctttc tctccacagg tgtccactcc 960 
gagcctgcta aagcaaaaaa gaagtcacca 1020 
ttttcgttgc cggtctggga ggcattggtc 1080 
atcccgtcgt tttacaacgt cgtgactggg 1140 
ttgcagcaca tccccctttc gccagctggc 1200 
cttcccaaca gttgcgcagc ctgaatggcg 1260 
aagcggtgcc ggaaagctgg ctggagtgcg 1320 
cctcaaactg gcagatgcac ggttacgatg 1380 
ttacggtcaa tccgccgttt gttcccacgg 1440 
ttaatgttga tgaaagctgg ctacaggaag 1500 
actcggcgtt tcatctgtgg tgcaacgggc 1560 
tgccgtctga atttgacctg agcgcatttt 1620 
tggtgctgcg ttggagtgac ggcagttatc 1680 
gcattttccg tgacgtctcg ttgctgcata 1740 
ttgccactcg ctttaatgat gatttcagcc 1800 
gcggcgagtt gcgtgactac ctacgggtaa 1860 
tcgccagcgg caccgcgcct ttcggcggtg 1920 
atcgcgtcac actacgtctg aacgtcgaaa 1980 
atctctatcg tgcggtggtt gaactgcaca 2040 
cctgcgatgt cggtttccgc gaggtgcgga 2100 
agccgttgct gattcgaggc gttaaccgtc 2160 
tggatgagca gacgatggtg caggatatcc 2220 
tgcgctgttc gcattatccg aaccatccgc 2280 
tgtatgtggt ggatgaagcc aatattgaaa 2340 
ccgatgatcc gcgctggcta ccggcgatga 2400 
atcgtaatca cccgagtgtg atcatctggt 2460 
atcacgacgc gctgtatcgc tggatcaaat 2520 
aaggcggcgg agccgacacc acggccaccg 2580 
atgaagacca gcccttcccg gctgtgccga 2640 
ctggagagac gcgcccgctg atcctttgcg 27O0 
gcggtttcgc taaatactgg caggcgtttc 2760 
tctgggactg ggtggatcag tcgctgatta 2820 
cttacggcgg tgattttggc gatacgccga 2880 
tctttgccga ccgcacgccg catccagcgc 2940 
tccagttccg tttatccggg caaaccatcg 3000 
gcgataacga gctcctgcac tggatggtgg 3060 
aagtgcctct ggatgtcgct ccacaaggta 3120 
agccggagag cgccgggcaa ctctggctca 3180 
catggtcaga agccgggcac atcagcgcct 3240 
gtgtgacgct ccccgccgcg tcccacgcca 3300 
tttgcatcga gctgggtaat aagcgttggc 3360 
agatgtggat tggcgataaa aaacaactgc 3420 
caccgctgga taacgacatt ggcgtaagtg 3480 
tcgaacgctg gaaggcggcg ggccattacc 3540 
cagatacact tgctgatgcg gtgctgatta 3600 
aaaccttatt tatcagccgg aaaacctacc 3660 
ccgttgatgt tgaagtggcg agcgatacac 3720 
agctggcgca ggtagcagag cgggtaaact 3780 
ccgaccgcct tactgccgcc tgttttgacc 3840 
ccccgtacgt cttcccgagc gaaaacggtc 3900 
gcccacacca gtggcgcggc gacttccagt 3960 
tgatggaaac cagccatcgc catctgctgc 4 020 
acggtttcca tatggggatt ggtggcgacg 4080 
tacagctgag cgccggtcgc taccattacc 4140 
cgggcaggcc atgtctgccc gtatttcgcg 4200 
acacaaactt ttggatgttc ggtttattct 4260 
acttcccgtt tttcccgatt tggctacatg 4320 
gtattatttt tgccgctatt tctctgttct 4 380 
tttctgacaa actcggcctc gactctaggc 4440 
atcaataaaa gatcattatt ttcaatagat 4500 
gagggggagg ccagaatgag gcgcggccaa 4560 
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gggggagggg gaggccagaa tgaccttggg ggagggggag gccagaatga ccttggggga 4 620 
gggggaggcc agaatgaggc gcgcccccgg gtaccgagct cgaattcact ggccgtcgtt 4 680 
ttacaacgtc gtgactggga aaaccctggc gttacccaac ttaatcgcct tgcagcacat 474 0 
ccccctttcg ccagctggcg taatagcgaa gaggcccgca ccgatcgccc ttcccaacag 4800 
ttgcgcagcc tgaatggcga atggcgcctg atgcggtatt ttctccttac gcatctgtgc 4860 
ggtatttcac accgcatatg gtgcactctc agtacaatct gctctgatgc cgcatagtta 4 920 
agccagcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg 4980 
gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca 5040 
ccgtcatcac cgaaacgcgc gagacgaaag ggcctcgtga tacgcctatt tttataggtt 5100 
aatgtcatga taataatggt ttcttagacg tcaggtggca cttttcgggg aaatgtgcgc 5160 
ggaaccccta tttgtttatt tttctaaata cattcaaata tgtatccgct catgagacaa 5220 
taaccctgat aaatgcttca ataatattga aaaaggaaga gtatgagtat tcaacatttc 5280 
cgtgtcgccc ttattccctt ttttgcggca ttttgccttc ctgttttt'gc tcacccagaa 534 0 
acgctggtga aagtaaaaga tgctgaagat cagttgggtg cacgagtggg ttacatcgaa 5400 
ctggatctca acagcggtaa gatccttgag agttttcgcc ccgaagaacg ,ttttccaatg 5460 
atgagcactt ttaaagttct gctatgtggc gcggtattat cccgtattga cgccgggcaa 5520 
gagcaactcg gtcgccgcat acactattct cagaatgact tggttgagta ctcaccagtc 5580 
acagaaaagc atcttacgga tggcatgaca gtaagagaat tatgcagtgc tgccataacc 5640 
atgagtgata acactgcggc caacttactt ctgacaacga tcggaggacc gaaggagcta 5700 
accgcttttt tgcacaacat gggggatcat gtaactcgcc ttgatcgttg ggaaccggag 57 60 
ctgaatgaag ccataccaaa cgacgagcgt gacaccacga tgcctgtagc aatggcaaca 5820 
acgttgcgca aactattaac tggcgaacta cttactctag cttcccggca acaattaata 5880 
gactggatgg aggcggataa agttgcagga ccacttctgc gctcggccct tccggctggc 594 0 
tggtttattg ctgataaatc tggagccggt gagcgtgggt ctcgcggtat cattgcagca 6000 
ctggggccag atggtaagcc ctcccgtatc gtagttatct acacgacggg gagtcaggca 6060 
actatggatg aacgaaatag acagatcgct gagataggtg cctcactgat taagcattgg 6120 
taactgtcag accaagttta ctcatatata ctttagattg atttaaaact tcatttttaa 5180 
tttaaaagga tctaggtgaa gatccttttt gataatctca tgaccaaaat cccttaacgt 6240 
gagttttcgt tccactgagc gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat 5300 
cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct accagcggtg 6360 
gtttgtttgc cggatcaaga gctaccaact ctttttccga aggtaactgg cttcagcaga 6420 
gcgcagatac caaatactgt ccttctagtg tagccgtagt taggccacca cttcaagaac 6480 
tctgtagcac cgcctacata cctcgctctg ctaatcctgt taccagtggc tgctgccagt 6540 
ggcgataagt cgtgtcttac cgggttggac tcaagacgat agttaccgga taaggcgcag 6600 
cggtcgggct gaacgggggg ttcgtgcaca cagcccagct tggagcgaac gacctacacc 6660 
gaactgagat acctacagcg tgagctatga gaaagcgcca cgcttcccga agggagaaag 6720 
gcggacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag ggagcttcca 6780 
gggggaaacg cctggtatct ttatagtcct gtcgggtttc gccacctctg acttgagcgt 6840 
cgatttttgt gatgctcgtc aggggggcgg agcctatgga aaaacgccag caacgcggcc 6900 
tttttacggt tcctggcctt ttgctggcct tttgctcaca tgttctttcc tgcgttatcc 6960 
cctgattctg tggataaccg tattaccgcc tttgagtgag ctgataccgc tcgccgcagc 7020 
cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg aagagcgccc aatacgcaaa 7080 
ccgcctctcc ccgcgcgttg gccgattcat. taatgcagct ggcacgacag gtttcccgac 7140 
tggaaagcgg gcagtgagcg caacgcaatt aatgtgagtt agctcactca ttaggcaccc 7200 
caggctttac actttatgct tccggctcgt atgttgtgtg gaattgtgag cggataacaa 7260 
tttcacacag gaaacagcta tgaccatgat tacgccaagc tagcccgggc tagcttgcat 7320 
gcctgcaggt tt 7332 



<210> 31 
<211> 72 
<212> DfcIA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 31 

atgccatggg ctacggccgc aagaagcgcc gccaacgccg ccgcggcatg tccaatttac 60 
tgaccgtaca cc "72 



<210> 32 
<211> 25 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequencer primer 
<400> 32 

tttcggatcc gccgcataac cagtg 



<210> 33 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 33 

tatatctaga ccatgggcta cggccgcaag • aagc 



<210> 34 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 34 

gctaccacga ccttcgatac catcgccatc ttccagcagg cgc 



<210> 35 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 35 

taactagcgg pcgcatgtcc aatttactga ccgtacac 



<210> 36 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 36 

tcgagcggcc gccatcgcca tcttccagca ggcg 



<210> 37 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: primer 
<400> 37 

tatatctaga catatgacct ctcgccgctc eg 32 



<210> 38 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 38 

ttccgaagac gacgaaacac c 



<210> 39 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 39 

tatattcgaa gcttattaac caccgaactg eg 



<210> 40 
<211> 4847 
<212> DMA 

<213> Artificial Sequence 
<220>. 

<223> Description of Artificial Sequence: vector 
pGK-cre-pA 

<400> 40 

aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tetaaataea 60 
ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa 120 
aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt 180 
ttgccttcct gtttttgetc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca 240 
gttgggtgca cgagtgggtt acategaact ggatctcaac ageggtaaga tccttgagag 300 
ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc 360 
ggtattatcc cgtattgacg ccgggcaaga gcaactcggt egccgcatac actattctca 420 
gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg gcatgacagt 480 
aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca acttacttct 540 
gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt 600 
aactcgectt gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga 660 
caecacgatg cctgtagcaa tggcaacaac gttgcgeaaa ctattaactg gcgaactact 720 
tactctagct tcccggcaac aattaataga ctggatggag gcggataaag ttgcaggacc 780 
acttctgcgc tcggcccttc cggetggctg gtttattgct gataaatctg gagccggtga 840 
gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct cccgtatcgt 900 
agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac agatcgctga 960 
gataggtgcc tcactgatta agcattggta actgtcagac caagtttaet catatatact 1020 
ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga 1080 
taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccecgt 1140 
agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct gctgcttgca 1200 
aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct 1260 
ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc ttctagtgta 1320 
gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct 1380 
aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc 1440 
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aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca 1500 
gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga 1560 
aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg 1620 
aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatc.ttt atagtcctgt 1680 
cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag 17 40 
cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt 1800 
tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt 1860 
tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga 1920 
ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc cgattcatta 1980 
atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca acgcaattaa 2040 
tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc cggctcgtat 2100 
gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg accatgatta 2160 
cgccaagcgc gcaattaacc ctcactaaag ggaacaaaag ctgggtaccg ggccccccct 2220 
cgaggtcgac ggtatcgata agcttgatat cgaattctac cgggtagggg aggcgctttt 2280 
cccaaggcag tctggagcat gcgctttagc agccccgctg gcacttggcg ctacacaagt 2340 
ggcctctggc ctcgcacaca ttccacatcc accggtagcg ccaaccggct ccgttctttg 2400 
gtggcccctt cgcgccactt ctactcctcc cctagtcagg aagtttcccc cagcaagctc 24 60 
gcgtcgtgca ggacgtgaca aatggaagta gcacgtctca ctagtctcgt gcagatggac 2520 
agcaccgctg agcaatggaa gcgggtaggc ctttggggca gcggccaata gcagctttgt 2580 
tccttcgctt tctgggctca gaggctggga aggggtgggt ccgggggcgg gctcaggggc 2640 
gggctcaggg gcgggcgggc gcccgaaggt cctcccgagg cccggcattc tgcacgcttc 2700 
aaaagcgcac gtctgccgcg ctgttctcct cttcctcatc tccgggcctt tcgacctgca 27 60 
gctcgaggtc gaccatgccc aagaagaaga ggaaggtgtc caatttactg accgtacacc 2820 
aaaatttgcc tgcattaccg gtcgatgcaa cgagtgatga ggttcgcaag aacctgatgg 2880 
acatgttcag ggatcgccag gcgttttctg agcatacctg gaaaatgctt ctgtccgttt 2940 
gccggtcgtg ggcggcatgg tgcaagttga ataaccggaa atggtttccc gcagaacctg 3000 
aagatgttcg cgattatctt ctatatcttc aggcgcgcgg tctggcagta aaaactatcc 3060 
agcaacattt gggccagcta aacatgcttc atcgtcggtc cgggctgcca cgaccaagtg 3120 
acagcaatgc tgtttcactg gttatgcggc ggatccgaaa agaaaacgtt gatgccggtg 3180 
aacgtgcaaa acaggctcta gcgttcgaac gcactgattt cgaccaggtt cgttcactca 3240 
tggaaaatag cgatcgctgc caggatatac gtaatctggc atttctgggg attgcttata 3300 
acaccctgtt acgtatagcc gaaattgcca ggatcagggt taaagatatc tcacgtactg 3360 
acggtgggag aatgttaatc catattggca gaacgaaaac gctggttagc accgcaggtg 3420 
tagagaaggc acttagcctg ggggtaacta aactggtcga gcgatggatt tccgtctctg 3480 
gtgtagctga tgatccgaat aactacctgt tttgccgggt cagaaaaaat ggtgttgccg 3540 
cgccatctgc caccagccag ctatcaactc gcgccctgga agggattttt gaagcaactc 3600 
atcgattgat ttacggcgct aaggatgact ctggtcagag atacctggcc tggtctggac 3660 
acagtgcccg tgtcggagcc gcgcgagata tggcccgcgc tggagtttca ataccggaga 3720 
tcatgcaagc tggtggctgg accaatgtaa atattgtcat gaactatatc cgtaacctgg 3780 
atagtgaaat aggggcaatg gtgcgcctgc tggaagatgg cgattagcca ttaacgcgta 3840 
aatgattgca gatccactag ttctagagct cgctgatcag cctcgactgt gccttctagt 3900 
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 3960 
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 4020 
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 4080 
aggcatgctg gggatgcggt gggctctatg gcttctgagn nngaaagaac cagctggggc 4140 
tcgagatcca ctagttctag cctcgaggct agagcggccg ccaccgcggt ggagctccaa 4200 
ttcgccctat agtgagtcgt attacgcgcg ctcactggcc gtcgttttac aacgtcgtga 4260 
ctgggaaaac cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag 4320 
ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa 4380 
tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 4440 
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 4500 
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 4560 
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 4620 
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 4680 
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 4740 
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 4800 
acaaaaattt aacgcgaatt ttaacaaaat attaacgctt acaattt , , . 4847 

L.i 

<210> 41 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer 
<400> 41 

catctccggg cctttcgacc tg 22 



<210> 42 
<211> 21 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 42 

gcgatcggtg cgggcctctt c 21 
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Claims 

1. Use of a fusion protein comprising 

(a) a site-specific DNA recombinase domain and 

(b) a protein transduction domain (PTD) 

for preparing an agent for inducing target gene alterations in a living 
organism or cell culture, wherein said living organism carries at least one 
or more recognition sites for said site-specific DNA recombinase integrated 
in an endogenous gene. 

2. Ttie use of claim 1, wherein the PTD is not derived from Antennapedla 
and preferably is a PTD derived from the VP22 protein of HSV or from the 
TAT protein of HIV. 

3. Use of a fusion protein comprising 

(a) a site-specific DNA recombinase domain and 

(b) a protein transduction domain (PTD) being not . derived from 
Antennapedla and preferably being derived from the VP22 protein of HSV 
or from the TAT protein of HIV 

for preparing an agent for inducing target gene alterations in a living 
organism or cell culture, wherein said living organism carries at least one 
or more recognition sites for said site-specific DNA recombinase integrated 
in its genome. 

4. The use of claim 3, wherein the recognition sites for said site specific 
recombinase is present within an endogenous gene or a transgene. 

5. The use of any one of claims 2 to 4, wherein the TAT protein comprises 

(i) the amino acid sequence YGRKKRRQRRR (SEQ ID NO: 10) or a mutant 
thereof including 

(ii) peptides having the amino sequences 
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AGRKKRRQRRR (SEQ ID NO: 22) 
YARKARRQARR (SEQ ID NO:23) 
YARAAARQARA (SEQ ID NO: 24) 
YARAARRAARR (SEQ ID NO: 25) 
YARAARRAARA (SEQ ID NO: 26) 
YARRRRRRRRR (SEQ ID N0:27) 
YAAARRRRRRR (SEQ ID NO:28); 

preferably the TAT protein consists of one of the sequences shown In (i) or 
(II) above. 

6. The use of any one of clajms 2 to 4, wherein the VP22 protein 
comprises the amino acid 16-157 of SEQ ID NO: 14. 

7. The use of any one of claims 1 to 6, wherein the site-specific DNA 
recombinase domain is selected from a recombinase protein derived from 
Cre, Flp, <j»C31 recombinase/ and R recombinase and preferably is Cre 
having amino acids 15 to 357 of SEQ ID NO: 2 or Pipe having amino acids 
15 to 437 of SEQ ID NO: 4. 

8. The use of any one of claims 1 to 7, wherein the protein transduction 
domain is fused to the N-terminal of the site-specific DNA recombinase 
domain. 

9. The use of any one of claims 1 to 8, wherein the protein transduqtion 
domain Is fused to the site-specific DNA recombinase domain through a 
direct chemical bond or through a linlcer molecule. 

10. The use of any one of claim 9, wherein the linlcer mblecule is a short 
peptide having 1 to 20, preferably 1 to 10 amino acid residues. 
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11. The. use of any one of claims 1 to 10, wherein said fusion protein 
further comprises additional functional sequences. 

12. The use of claim 1, wherein the fusion protein has the sequence 
shown in SEQ ID NOs: 2, 4, 6 or 8. 

13. The use of any one of claims 1 to 12, wherein the living organism is a 
vertebrate, preferably a rodent or a fish. 

14. A method for inducing gene alterations, in a living organism which 
comprises administering to said living organism, a fusion protein 
comprising a site-specific DMA recombinase domain and a protein 
transduction domain as defined in claims 1 to 12, .wherein said living 
organism carries at least one or more recognition sites for said site- 
specific DMA recombinase integrated in its genome. 

15. A fusion protein comprising 

(a) a site-specific DNA recombinase domain as defined in claims 2 to 9 
and 

(b) a protein transduction domain (PTD) as defined In claims 2 to 9 
provided that when (a) is the wild-type Flp or Cre then (b) is not the full 
length VP22 protein of HSV. 

16. The fusion of claim 15, wherein the (PTD) is derived from the TAT 
protein of HIV. 

17. A DNA sequence coding for the fusion protein of claim 15 or 16, said 
DNA sequence preferably comprising the sequence shown in SEQ ID 
NOs:l, 3, 5, 7, 9, 11, 13, 18 and/or 20. 



18. A vector comprising the DNA sequence of claim 17. 
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19. A host ceil transformed with the vector of claim 18 and/or comprising 
the DNA of claim 17. 

20. A method for producing the fusion protein of claim 15 which comprises 
culturing the transformed host ceil of claim 19 and isolating the fusion 
protein. 

21. An injectable composition comprising the fusion protein as defined in 
claims 1 to 12 or 15 to 16. 
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Fig. 1 
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Figure 3 
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Fig- 5 
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Figure 6 
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Fig. 7 
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Fig. 10 
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